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CHAPTER 1 
Mathematical Induction 


1. The induction principle 


Suppose that we want to prove that 
“P(n) is true for every positive integer n”, 

where P(n) is a proposition (statement) which depends on a positive integer n. Proving P(1), 
P(2), P(3), etc., would take an infinite amount of time. Instead we can use the so-called 
induction principle: 
Induction Principle. Assume that k is an integer and P(n) is a proposition for all n > k. 

(1) Suppose that P(k) is true, and 

(2) for any integer m > k for which P(m) is true, P(m + 1) is true. 
Then P(n) is true for all integers n > k. 


The induction principle can be compared to an infinite sequence of dominos tiles, num- 


bered 1,2,3, etc. 


1 2: 3 4 5 6 7 
If the m-th domino tile falls, it will hit the (m+ 1)-th domino tile and the (m+ 1)-th 
domino tile will fall as well. If the first domino tile falls, then all domino tiles will fall down. 
(Here P(n) is the statement: “the n-th domino tile falls down”) 


// 
yf | 


1 2 3 4 5, 6 7 
Since the induction principle is intuitively clear, we will simply accept it without proof. 
This is why it is called an axiom. (We cannot formally prove the induction principle without 
making other, similar assumptions.) 
A typical example of the induction principle is the following: 


EXAMPLE 1.1. Prove that 


n(n +1) 


1 14+24+34+--+n= 
(1) 3 n ; 


for every positive integer n. 
PROOF. We prove (1) by induction on n. For n = 1 we check that 


poh@+y 
= 


3 


Suppose that (1) is true for n =m. Then 
1+24+---¢m+(m4+1) =(1424+---+m)+ (m+) = 
_ m(m + 1) Ge th (m+ 1)(m+ 2), 


2 2 
so (1) is true for n = m+ 1. Now (1) is true for all positive integers n by the induction 
principle. 


REMARK 1.2. When the German mathematician Carl Friedrich Gauss (1777-1855) was 
10 years old, his school teacher gave the class an assignment to add all the numbers from 1 
to 100. Gauss gave the answer almost immediately: 5050. This is how (we think) he did it: 
Write the numbers from 1 to 100 from left to right. Write under that the numbers from 1 
to 100 in reverse order. 


1 2 3 --- 100 
100 99 98 --- 1 
101 101 101 --- 101 
ee 

100 


Each of the 100 column sums is 101. This shows that 
2-(1+2+---+100) = 100-101 


and 


100-101 
Dp Gest ers a OU—aay  See 


This easily generalizes to a proof of (1). Gauss’ proof can be graphically presented. For 
example, to see that 


2-(1+2+---+10) = 10-11, 


look at the following picture: 


ms 
Oo 


at 
al 9 
3 | 8 
: |4 | 7 
i (5 | 6 
10 6 | 5 
7 4 
8 3 
9 2 
10 1 


A formula similar to (1) exists for the sums of squares, namely 


(2) Piety cy ge MEE Pay) 
G : 


EXAMPLE 1.3. Give and prove a formula for 
1? oe)? Avast? 
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We have seen similar examples, namely (1) and (2). We can also add the formula 


DP Be Sa, 


Let 
p(n) = 1 + 2% + 3% 4...4+n% 

where k € N. The examples so far suggest that p,(n) is a polynomial of degree k+1 (and that 
the leading coefficient is ea Let us assume that p3(n) is a polynomial of degree 4. Since 
p3(0) is an empty sum, we have that p3(0) = 0. We can write p3(n) = an* + bn? + en? + dn. 
for certain real numbers a,b, c,d. We have 
(3) 

n® = p3(n) —p3(n—1) = a(n*—(n—1)*)+0(n3 —(n—1)*) +e(n? —(n—1)?)+d(n—(n—1)) = 

= a(4n? — 6n? + 4n — 1) + 0(8n? — 8n +1) + c(Q2n—1) +d 
= n3(4a) + n?(—6a + 3b) + n(4a — 3b + 2c) + (-a +b —c+d) 


Comparing coefficients in (3) gives us the linear equations: 


(4) 1 = 4a 

(5) 0 = —6a+3b 

(6) 0 = 4a-—3b+ 2c 
(7) O = Geb =o 


We solve the system of equations and find a i, b , c : and d = 0. We now should 
conjecture the following formula: 


P42 +.--+nF = 


Finding this formula was the hard part. It is now not so hard to prove this formula by 
induction: 


TA ope. 1,,2 
fils Foote ats 


PROOF. We will prove that 


(8) 19+ 2° +--+. +n? = Ent + 5n3 + $n? 


by induction on n. The case n = 0 is clear, because both sides of the equation are equal to 
0. If (8) is true for n = m — 1, then 


From this follows that 

184+ 2% +4---+(m—1)? +m? = t(m—1)4+ $(m— 1)? + t(m-1P? +m = 

= 7(m* — 4m? + 6m? — 4m + 1) + (m3 - 
4 


243m —1)+4(m? -2m+1) +m = 


4 4 


= 4m | 


so (8) is true for n = m. By induction follows that (8) is true for all n € N. 


Notice that 


tm! + $m? + tm? = ($n(n 4+ 1))? 
which leads to the following aesthetic formula: 
422 4---+n%3=(14+24---+n)?. 
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EXAMPLE 1.4. What is the value of 


ee ae ae 


1-2 2-3 3-4 


Let us compute the partial sums. Perhaps we will find a pattern. 


Th te ly Ae 
ae es ae cae es 
SP 2 ee ee 
i ee oe ee ee a ee ee me 
1 1 1 it Sas 1h. “I, = 16. 


{oo O60 G24 > dee. A 90 “90. 0: 0. G 
A pattern emerges. Namely, it seems that 


i! 1 1 1 
(9) petyal se ; 
1-2 2-3 n(n + 1) age 
PRooF. By induction on n we prove: 
1 1 1 1 
10 | bee = 
(10) 1-2 2-3 n(n + 1) n+1 
For n = 1 we check 
1 1 
ae, 
Ls 2 2 
If (10) is true for n = m, then 
Die ead 1 1 = 
he2" 3. ~m(m+1)° (m+1)(m+2) 
1 1 1 il 
nae Hep le We? neg 
Hence (10) is true for n = m+ 1. By induction, (10) is true for all integers n > 1. We have 
1 1 1 1 
! ! fe: = lim (1- = 
(2, Das SA aad 


EXAMPLE 1.5 (UMUMC, 1988). Let S,, be the set of all pairs (x,y) with integral coor- 
dinates such that x > 0, y > 0 and x+y <n. Show that S,, cannot be covered by the union 
of n straight lines. 


First we should try a few small cases, say n = 0,1, 2,3, 4: 


e e.e 
e eo eee 
e e ee — eo @-@ 
e ee ee oe. e-0 ee, eoeeee 
n=0 n=1 n=2 n=3 n=4 


Notice that S,, is a subset of S,,4;. This will be helpful for our induction proof: 
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PROOF. We prove the statement by induction on n, the case n = 0 being trivial. Suppose 
that one needs at least n + 1 lines to cover S,. Define Cyi1 = Snyi \ Sn. The set Cri 


consists of n + 2 points on the line «+ y= n+ 1. Suppose that & lines ¢;, 0.,...,€, cover 
Snail 

case 1: One of the lines is equal to the line x + y=n+1. Without loss of generality we 
may assume that ¢; is equal to the line x+y =n+1. Then (1, f2,..., 2,1 cover S, because 


ly AS, =. From the induction hypothesis follows that k -—1>n+1,sok>n+2. 
case 2: None of the lines are equal to the line x+y = n+1. Then each of the lines intersects 
the line x + y = n+ 1 in at most one point, and therefore it intersects the set Cy+1 in at 
most one point. Since C’,,, has n + 2 elements, there must be at least n + 2 lines. 

So in both cases we conclude that one needs at least n+ 2 lines to cover S,,41. 


2. Strong Induction 


The following example illustrates that sometimes one has to make a statement stronger 
in order to be able to prove it by induction. 
EXAMPLE 1.6. Prove that 
Pg Oe 999, 999 ee 1 
2: Al G 1,000,000 1000 
Since 1000 = \/1,000, 000 one might suggest that 
13 9 aw—Il 2 1 
24 6 2n J2n 
for all n > 1. Let us try to prove (11). We can check (11) for small n (which gives some 
validity to our conjecture that this inequality holds). Suppose that (11) holds for n = m: 
13 59 2%m-1 2 1 
2 4 6 2m V2m 
We have to prove (11) forn =m+1: 
Peas Sera ey 1 
2: Ae 66 2m + 2 2m +2 
If we divide (13) by (12) we obtain 


(11) 


(12) 


(13) 


2m+ 1 ¥ 2m 
2m+2~— V 2m+2 
If (12) and (14) are true, then (13) is true. By squaring (14) we see that (14) is equivalent 


to 
Im+1\? 2m 
< 
2m +2 ~ 2n+2 
and to 


(15) (2m + 1)? < (2m + 2)(2m) 


So if (15) is true then our induction proof is complete. Unfortunately (15) is not true and 
we are stuck. 
Sometimes it is easier to prove a stronger statement by induction: 


(14) 
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PROOF. We prove 


(16) Ee see 1 
Do od 2n fant 
by induction on n. The case n = 1 is clear because 
eee 
a es 
Suppose that (16) is true for n = m: 
1 3 2m — 1 1 
(17) Da pe Gey 


Since 
(2m + 1)(2m + 3) = (2m + 2)? -1< (Qm+2)? 


Mm+1 v2 mMm+1 
2m+2 2m4+3 
and 


(18) ee pales 
2m + 2 2m +3 

Multiplying (17) by (18) yields 
13 ey 2m+1 Z 1 
24 2m+2 2m+3 
so (16) is true for n = m+ 1. This shows that (16) is true for all positive integers n. In 
particular, for n = 500,000 we get 

1 3 999, 999 2 1 e 1 

24 1,000,000 ~ ./1,000,001 ~ 1000 


we have that 


(19) 


d 


Below is a trickier proof of Example 1.6. 


PROOF. Let 
he. 1-3-5---999, 999 
2-4-6---1,000, 000 
and 
pa 2:4:5--- 1,000, 000 


3-5-7---1,000, 001° 
Clearly A < B because 
28 is 4 999, 999 e 1, 000, 000 
2 ~3'4 ~ 5’? 1,000,000 ~ 1,000,001 


It follows that 
1 1 


A? <AB= ee 
1,000, 001 1, 000, 000 


and A < 10007!. 


EXAMPLE 1.7. Prove that every integer n > 2 is a product of prime numbers. 
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Proor. Let Q(n) be the statement: 


“every integer r with 2 <r <n is a product of prime numbers.” 


We use induction on n to prove that Q(n) holds for all integers n > 2. 

For n = 2 the statement is true because 2 is a prime number. Suppose that Q(m) is true. 
We will prove Q(m+1). Suppose that 2<r<m-+1. Ifr <m then r is a product of prime 
numbers because Q(m) is true. Suppose that r = m+ 1. If m+ 1 is a prime number, then 
m+ 1 is a product of prime numbers and we are done. Otherwise, m+ 1 can be written as 
a product ab with 1 < a,b <_m. Because Q(m) is true, both a and b are products of prime 
numbers. Hence m+ 1 = ab is a product of prime numbers. 

We have shown that Q(n) holds for all n > 2. In particular, every integer r > 2 is a 
product of prime numbers because Q(r) is true. 


3. Induction in definitions 


We can also use induction in a definition. For example, the Fibonacci numbers is a 
sequence of numbers Fo, F,, Fo,... defined by Fp = Fy = 1 and 


Pea = Fe ats (ease 


By (strong) induction on n we can prove that F,, is well-defined for all integers n > 0. The 
first few Fibonacci numbers are: 


1,1, 2,3,5,8, 13, 21, 34, 55,89,... 


The sum notation is an example of a recursive definition. Suppose that f(n) is some 
function. If a,b are integers and a < b+ 1 then we define 


S> f(n) 


as follows. 7 
> a= 
and _ 
b b-1 
(20) SY" f(n) = fO) + >— fn) 
ifb> a. 


One can then formally prove by induction that 


S~ fin) = $7 f(r) + SD f(n). 


n=b+1 


ifa,b,c€ Zanda—1<b<ce. (Induction on c. Start with c = b.) 
Similarly we have the product notation. 


TL f@) =1 
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and 


[[f@ =10 TL ro. 


ifb> a. 
For nonnegative integers m and n with m < n we define a binomial coefficient by 
(”) 1 ifm =Oorm=n 
m aed - (x) im <n 


If we arrange the binomial coefficients in a triangular shape, we get Pascal’s triangle: 


4. Exercises 


EXERCISE 1.1. * Prove that 
n(n + 1)(2n + 1) 


i ga ee 
6 
for all positive integers n. 
EXERCISE 1.2. * Show that 
fe 1 1 1 _ 1 1 ' 1 
2B Op OA nL WD Qn 


for alln EN. 
EXERCISE 1.3. * Prove that 
5. () _ een 
= m Mish 
for all nonnegative integers m and n. 


EXERCISE 1.4. * Prove that 


(a +b)" = (5) a? (") a? 4 (5) ae ee (") B”. 


EXERCISE 1.5. * 


(a) Prove that 
ntl =e 7] 


ca 
for every real number x and every positive integer n. 
(b) If x is a real number with |x| < 1 then 


1 
1 ge : 
1-2 


EXERCISE 1.6. * Show that the sum of the squares of two consecutive Fibonacci numbers 
is again a Fibonacci number. 


EXERCISE 1.7. ** Cut out a 1 x 1 corner of a 2” x 2” chess board (n > 1). Show 


that the remainder of the chess board can be covered with L-shaped tiles (see picture). 
2 


1 


The case n = 2 is shown below. 


EXERCISE 1.8. ** Find and prove a formula for 
Icio is ergo iad 
EXERCISE 1.9. ** Show that 


as follows: Define 


fv) =(1+2)" = (5) + (Tete Ge 


and consider f(*)(0) (f(a) is the k-th derivative of f(x)). 
EXERCISE 1.10. ** Prove that 
n l _ 
fie —= <2V/n 
vn d. Gi < ev 


for all integers n > 2. 


EXERCISE 1.11. ** Define a sequence aj,d2,... by a; = 2 and dn41 = a2 —2 forn>1. 
Give an explicit formula for a, and prove it. 
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EXERCISE 1.12 (Division with remainder). ** Suppose that n is a nonnegative integer, 
and m is a positive integer. Prove that there exist integers q and r with n = qm+~r and 
O<r<m. 


EXERCISE 1.13. ** Give and prove a formula for 


1 1 : 1 
fees “Bese Oo GEA) s GD), 

EXERCISE 1.14 (Expansion in base b). *** Suppose that n is a positive integer, and b is 
an integer > 2. Show that there exist an nonnegative integer m, and integers do, a1,...,@m € 
{0,1,2,...,b—1} such that 
(21) = Gmb™ Hama0 aor + a 
and am 4 0. Moreover, show that m and ag, a1,...,@m are uniquely determined by n. (We 


will write (@m@m_1°++@0)» for the right-hand side in (21)). 

EXERCISE 1.15. *** Suppose that n is a positive integer. Show that we can write 
where & is a positive integer, 1 < 7; and 2; > ij; + 2 for 7 = 2,3,...,k. Also show that k 
and 71,...,7%% are uniquely determined by n. 

EXERCISE 1.16 (Putnam 1985, B2). *** Define polynomials f,,(xz) for n > 0 by fo(x) = 1, 
fr(0) = 0 for n > 1, and 


FUins(2)) = (n+ Dale +1) 


for n > 0. Find, with proof, the explicit factorization of fijoo(1) into powers of distinct 
primes. 


EXERCISE 1.17 (Putnam 1987, B2). *** Let r, s and t be integers with 0 < r, 0 < s and 


r+s<t. Prove that 
CE), A Soe a eke Meta > IE 
t T 


CP ea GE aa): 
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CHAPTER 2 


Extremal Principle 


It sometimes can be very useful to assume that a certain quantity is maximal. We will 
see various examples of this. 


1. The Discrete Extremal Principle 


The (discrete version) of the extremal principle can be formulated as follows: 


Discrete Extremal Principle A real valued function f on a finite set S has a maximum 
and a minimum. 


EXAMPLE 2.1 ((UM)*C'® 2001 2). **** Show that the people at a party can be divided 
into two groups and sent to two different rooms in such a way that, for every person in 
either room, at least half that person’s friends at the party are in the other room. (You may 
assume that friendship is a symmetric relation. 


Proor. Let m the the number of all pairs {P,Q} of people such that P and Q are in 
different rooms and P and Q are friends. We may assume that m is maximal over all possible 
ways of dividing the people in two groups. Suppose some person P has ap friends in his own 
room and 0p friends in the other room. If P would move to the other room then we have 
to add bp — ap to m. By our maximality assumption on m, we get that bp < ap for all P 
which is what we wanted to prove. 


2. The Continuous Extremal Principle 


There is also a continuous version of the Extremal Priciple. This is not quite as obvious. 
It is well known in calculus. 


Continuous Extremal Principle A continuous real-valued function f on a closed interval 
[a,b] CR has a maximum and a minimum. 


THEOREM 2.2 (Rolle’s Theorem). Suppose that f is a real-valued differentiable function 
on an open inverval (a,b) which has a maximum (or minimum) atc € (a,b). Then f'(c) = 0. 


PRooF. By definition 


_ seth) — fle) 
f'(c) = lim 7 


h-0 
In particular 


/ : f{e a h) ZZ f(o) 
— << 
a ett h <0 
because f(c) is the maximal value of f. On the other hand 
h) — 
f'(c) = lim ites ) fle) 


hto 


= 0; 
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This shows f’(c) = 0. 
EXAMPLE 2.3. Show that 


3 
Se eer | 
x 
for x > 0. 


Proor. Let f(r) = 23+ 3/zx. Clearly if 0 < x < } and if z > 2, then f(z) > 4. The 
continuous function f(x) has a minimum on the interval [5,2], say at c. Ifc = § or c = 2 
then f(x) > f(c) > 4 for all x € [3,2]. Assume now that $< c <2. Then we must have 


5) 
f'{c) a a =i) 


by Theorem 2.2. We easily solve this and find c = 1. Now we have 
f(x) 2 f) 24 


for all x € [4,2]. 


An immediate consequence of the previous theorem and the Continuous Extremal Prin- 
ciple is the Mean Value Theorem: 


THEOREM 2.4 (Mean Value Theorem). Let f be a continuous real-valued function on the 
closed interval [a,b] C R which is differentiable on the open interval (a,b). Then there exists 


ac € (a,b) such that 
f(b) ~ f(a) 
b-a — 
a(2) = fa) — f(a) - HOM @ 


Note that g(a) = g(b) = 0. There exists ac € [a,b] with g(c) maximal. If c =a orc=b, 
then g(x) < 0 for x € [a,b] and we can find a c € (a,b) such that g(c) is minimal. In any 
case there is a c € (a,b) for which g(c) is maximal or minimal. From the previous theorem 


follows that 5 
go) = s(¢) - OMY) Lo 


PROOF. Put 


THEOREM 2.5. Suppose f is a differentiable function on an interval (a,b). Then f is 
(weakly) increasing if and only if f(x) > 0 for alla < « < b. (Weakly increasing means 
here that f(a1) < f(ae) ifa<a1< 22 <0.) 


PROOF. Suppose that a < 71 < x2 <b. By the Mean Value Theorem there exists a c in 
the interval (x1, 22) such that 
f (x2) = f (#1) ee f'(o) 
v2 — Ly 
Since f’(c) > 0 and x2 > 21, we have f(x2) > f(x1). 


EXAMPLE 2.6. * Show that sin(x) < x for x > 0 and sin(x) > x for x < 0. Also show 
that cos(z) > 1— $2? for allz ER. 
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PROOF. Consider f(x) = sin(z)—2a. Then f’(”) = cos(x)—1 < 0. This means that f(x) 
is weakly decreasing on R. Since f(0) = 0, we have f(x) < 0 for all x > 0 and f(x) > 0 for 
all x < 0. Now consider g(x) = cos(z) — 1+ $27. we have g’(x) = —sin(z) + = —f(z). 
Therefore g'(x) > 0 for x > 0 and g'(x) < 0 for x < 0. It follows that g is weakly increasing 
for x > 0 and g is weakly decreasing for x < 0. Since g(0) = 0 we have g(x) > 0 for all 
GE Tk: 


EXAMPLE 2.7. Suppose that 71, %2,...,2%, are real numbers such that 0 < x; < 1 for all 
i. What is the maximum possible value of 


PROOF. We want to maximalize the function 


n n 


J Sishoieqhe) = ys So (zi — re 


i=1 j=l 


(With some analysis one can see that f must have a maximum value, because f is a con- 
tinuous function on a compact set. Don’t worry if you do not understand this. Perhaps we 
will discuss it later, but we will not use it now.) Let us fix x2, 73,...,%p, and consider f as 
a function of one variable 7,. Say f = ax7+ bx, +c where a = n? > 0 and b,c are constants 
depending on %2,%3,...,2%. Now f could have a local extremum, but this would always be 
a local minimum because a > 0. The maximum of f is therefore at x; = 0 or at 7; = 1. 

From this discussion it is clear that we can replace x, by 0 or by 1 without decreasing 
the value of f(x1,%2,...,2n). Similary, we can replace x2 by 0 or by 1 without decreasing 
the value of f etc. So 


Fis Poss 3) < Ff (Uistig nasa) 


for some choices y1, y2,---,;Yn € {0, 1}. 
So we are looking for the maximum value of 


F (Yt, Yor ++ +5 Yn) 
where 1, Y2,---,Yn € {0,1}. By symmetry we may assume that y; = yo =--- = yy =0 and 
Ye+i = Yet2 = +++ = Yn = 1. In that case, the value of f(y,..-,Yn) isk(n—k) + (n—k)k = 
2k(n —k). The function 2k(n — k) is again a parabola with the maximum at k = }. But k 
has to be an integer. It follows that the maximum value of f(21,%2,...,%n) is 
2 
ae 7 - _7n 
2 2 2 
if n is even and 
n-1 n—-1 n?—1 
2 ~ = 
ie mre eee) 


if n is odd. 
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Exercises 


EXERCISE 2.1. *** There are n people standing in a field, each carrying a gun. Every 
person shoots the person nearest to him (all people shoot at the same time, all distances are 
distinct). Show that at least one person survives if n is odd. 


EXERCISE 2.2. ** Let f : Z— Z be an integer-valued function on Z with 
2f(n) < fn—1)+f(n+1) 


for all n € Z. Prove that f has arbitrary large values. 


EXERCISE 2.3. **** Let S be a measurable subset of R? with area A. Show that one 
can choose a set TC S of at least A/a points in S such that all pairs of distinct points in 
T have distance at least 1. (Hint: Suppose that S is maximal. Consider balls of radius 1 
around each point. What can you say now?) 


The following result is actually useful in coding theory and is known as the Gilbert- 
Varshamov bound. 


EXERCISE 2.4. **** Let S = {0,1}”" (this means that S is the set of all sequences with 
just zeroes and ones of length n). For a = (aj, d2,...,@n) and b = (b),b2,...,bn) we define 
the Hamming distance by 

d(a,b) = #{i | a: £53} 


i.e., the number of indices for which a; 4 b;. Prove that for every positive integer k there 
exists a subset T' of S with at least 
PAG 
() +) + G) +--+ G2) 
elements such that d(a,b) > k for all distinct a,b € T. (The solution to this problem is 
similar to the solution of Problem 2.3.) 


EXERCISE 2.5. *** Suppose that aj, a2,...,@n, 01, b2,...,b, are real numbers such that 
ay < dg < +++ < a, and b) < bo < +--+ < Dy. Let c,c2,...,c, be a permutation of 
b,, bo,...,6,. Show that 

ab, + dgby_1 Sea And < a,Cy + AgCo + +++ + AnCy < a0 1 + a1 b free $f An Dn: 


(Hint: Choose a permutation c,,...,Cpj such that ajc, + +++ + nC, is maximal. Prove that 
Cy < a < Cpe) 


EXERCISE 2.6. *** Suppose that a, < ag <-+-+- <a, and by < by <--- < by. Prove the 
following Chebychev inequality: 


Ay +agt---+Gn by tbgt---+bn < a0, + dobg +++: + andy 


n n n 
(Hint: Use the previous problem.) 


EXERCISE 2.7. ** For every integer n, prove that there exists a subset S of {1,2,...,n7} 
with n elements ,so that the difference of any two distinct elements of S is not a square. 


EXERCISE 2.8. ******* Suppose that there are n lines in the Euclidean plane R? such 
that 


16 


(a) Every two lines intersect; 

(b) Through any intersection point of two lines there goes at least one other line. 
Prove that all lines go through one point. (Hint: Choose a line @ and an intersection point 
P, not on £, such that the distance of P to ¢ is minimal. Deduce a contradiction.) 


EXERCISE 2.9. ***** Suppose that in the plane R?, there are n blue points and n red 
points (all of them distinct). No four points are on a line. Show that you can label the blue 
points with B,, Bo,...,B, and the red points with R,, Ro,..., R, such that the line segments 
B;R; do not intersect each other. (Hint: Choose a labeling with )>;"_, |B;R;| minimal, where 
|B;R;| is the distance from B; to R;.) 


EXERCISE 2.10. *** Suppose that f is a differentiable function on the interval [0, 2). 
Prove that there exists an element x € [0,2] with 


f"(@) = f() — 2f(1) + F(2). 


EXERCISE 2.11. ** Find the maximum value of 


1 


Le 
for x > 0. 
EXERCISE 2.12. * Show that 
e >axrt+l1 
for allz ER. 
EXERCISE 2.13. * Show that 
log(x) <a-1 


for all x > 0 (here log(z) is the natural logarithm). 


EXERCISE 2.14. ** Suppose that f is a differentiable function on [0, oo) such that f(0) = 
land f’(x) > f(x) for all z > 0. Prove that f(x) > e® for all x > 0. 


EXERCISE 2.15. ** Suppose that f(x) is a continuous function on R which can be dif- 
ferentiated twice, and f”(x) > 0 for all 2 € R. Show that f(x) is positive for some x € R. 


EXERCISE 2.16 (Putnam 1985). *** Let T be an acute triangle. Inscribe a rectangle R 
in T with one side along a side of JT. Then inscribe a rectangle S in the triangle formed by 
the side of R opposite the side of the boundary of T and the two othert sides of 7’, with 
one side along the side of R. For any polygon X, let A(X) denote the area of X. Find the 
maximum value, or show that no maximum exists, of 

A(R) + A(S) 
A(T) 
where 7 ranges over all trinagles and R,S over all rectangles as above. 
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CHAPTER 3 


Inequalities 


1. Elementary inequalities 


Perhaps the most fundamental inequality for real numbers is 


x? > 0, ceER. 


Using this inequality one can deduce many more inequalities. For example, if we take 
x =a-—b0b with a,b € R we obtain: 


a® — 2ab+ b? = (a—b)? > 0. 


It follows that 
Py 2 
a“ +6 = 
= 
This inequality is interesting by itself. If we now substitute a = \/y and b = \/z we obtain 


ab. 


Ye 


ae 2 V2. 


whenever y, z are nonnegative real numbers. Substitution is a very useful method for proving 
inequalities. 


EXAMPLE 3.1. Prove that 


ae ee > ab-ae-e be 


for all a,b,c € R. Also prove that equality holds if and only if a= b=c. 


PROOF. We have 
a? +b? +c — ab —ac— be = $((a — 6)? + (b—c)? + (c—a)”) > 0 


and it is now obvious that equality holds if and only ifa=b=c=0. 


Another obvious but important inequality is: 


zy>0, ifv,yeRandzx>0O0andy>0. 


This can be used in many ways. For example if 0 < x < 1 then 
xL> x 


because 
g—¢° =a —2)>0 
and both x and 1 — x are nonnegative. 
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EXAMPLE 3.2. Suppose that 71, %2,...,2, are real numbers such that 0 < x; < 1 for all 
7. Prove that 


Cy Dy PP Dy Bis oP Gots poeta +e sb Bali: 
When do we have equality? 
PROOF. The inequality is obvious because it is equivalent to 
v1(1 — 22) + vo(1 — &3) +--+ 2, (1 -— 21) > 0. 
If we have equality then 


g=Oorg,=1, ew =Oorrvg=1,... ,#, =Oorz,=1. 
If x; # 0 then x2 = 1 and in particular x2 # 0. From this it follows that 73 = 1. But then 
x3 #0, so x4 = 1, etc. This way we see that rg = 73 = 14 Ln = 21 = 1. Ina similar 
way we see that if x; # 0 for some i, then 2, = ry = 13 = +--+ = Ly, = 1. The only other case 
where equality holds is when x, = rg = --- = Lp, = 0. 


Making the right substitutions can be very helpful as the following example shows. 


EXAMPLE 3.3. Suppose that a,,a2,...,@pn are real numbers such that a; > 1 for all 2. 
Prove the inequality 


Tm 


(1+ a1)(1 +42) ---A + @n) 2 


Let us write a; = x; +1. Then x; > 0 for all 7. It is easier to deal with the inequality 
x; > 0 than with the inequality a; > 1. The inequality transforms to 


(l+a,+a.+---+a,). 


Jie 
(2+ 21)(24%2)-(2+a,) 2 Ge Paige ae te te (ree) 
gn 
= 94 mare po f---4 Laie 


This inequality follows already if we only look at the constant and linear part of the left- 
handside: 


gn 
(2+ x1)(24+ x2) ---(24+ ay) > 2" +2" (xy + eo 4+-+++2,) > 2” 4 mares bees +n). 
because ; 
gual S gn-l : 
n+l n+1 


2. Convexity 


Let f be a real-valued function on an interval J C R. Now f is said to be convex if 


f(ta+ (1—t)b) <tf(a) + (1 —t)f (0) 
for all t € [0,1] and all a,b € J (the chord between (a, f(a)) and (0, f(b)) lies above the 
graph of f). The function f is said to be concave if 

f(ta + (1 —t)b) > tf(a) + (1 — t) f(b) 
for all t € [0,1] and all a,b € J (the chord between (a, f(a)) and (b, f(b)) lies below the 
graph of f). 


20 


| at 


concave convex 
(You may well be used to a different terminology, for example “concave up” and “concave 
down” instead of “convex” and “concave” .) 


THEOREM 3.4. Suppose that f is a real-valued function on I CR, %,%2,...,% € I, 
and t1,to,...,t, € [0,1] witht) +te+---+t, =1. If f is convex, then 
(22) Fiver + tote +++ + tnttn) S tf (81) + tof (ta) +> + taf (Gn). 
If f is concave, then 
(23) J Gity + tole Fo + inky) St f(81) + tof (eo) + 0 + taf (Zn). 

PROOF. Suppose that f is convex. We will prove the statement by induction on n, the 
case n = | being trivial. Suppose that we already have proven that 

f(tiv1 + too free Hf th a;) < tif (x1) + tof (x2) tere t bat (ea) 


for all 41, %2,...,2%, € I and all t), to,...,t, € [0,1] witht; +t2+---+¢#,=1. 

Suppose now that 21, %2,...,2n4i € J and ty,...,tnyi € [0,1] with t1+to+---+try =1. 
Define s; = t;/(1 — tn41) for i = 1,2,...,n. Note that 5; + sg +---+ 5, = 1. Take 
G = 8101, + 89%o +--+ + Sn Xp, b= Fyy1 and t = 1 — ty41. From the definition of convexity 
and the induction hypothesis follows that 


F(tizi +--+ + trtifngi) = f(tat+ (1 —t)b) <tf(a) + (1 —t)f(d) = 
= (1 — thoi) f(s1t1 +--+ + Sn0n) + triif(tn41) < 
< (1 — trys) (sf (21) + S2f(@2) + +++ + Snf(Gn)) + trtif(Sn41) = 
= tf (21) +--+ + tr4if(en41)- 


To prove the second statement, observe that f is concave if and only if —f is convex. 
Then apply the first statement to —f. 


In particular the case t) = tg = --- =t, = 1/n is interesting. 


COROLLARY 3.5. If f is conver on I, then 


f( 


ee) 
n 


forallay,.t a, El. 
If f 1s concave on I, then 


f 


n i n 
forall 275242046 T- 
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THEOREM 3.6. Suppose that f is a real-valued function on an interval I C R with a 
second derivative. If f’(x) > 0 for alla € I, then f is convex. If f’(x) <0 for all x € T, 
then f is concave. (The converse of these statements are also true). 


ProoF. If f”’(x) > 0 for all x € J then f’(x) is weakly increasing on the interval J. 
Suppose that a,b € I and t € [0,1]. Define c = ta+ (1 —t)b. By the Mean Value Theorem, 
there exist a € (a,c) and 6 € (c,b) such that 


f(a) = fO=$O ong pry — fO=FO. 


CcC—a b-c¢ 
Since a < and ‘ is weakly increasin , we have 
g 


f(ta+ (1 —#)b) — F(a) _ Fle) — F(a) 


(1 —2)(b— a) ie ee ra Os 
< f’(8) = MO) Te) _ f= fest — t)b) 


Multiplying out gives 
f(ta+ (1—t)b) <tf(a) + (1—- tf). 
This shows that f is convex. 
The second statement follows from the first statement, applied to —f. 


EXAMPLE 3.7. Suppose that a, @,y are the angles of a triangle. Prove that 


sin(a) + sin(Z) + sin(y) < af 


Proor. The function sin(x) is concave on the interval [0,7], because its second derivative 
is —sin(x) < 0. Thus we have 
sin(@) + sin(3) + sin(y7) 
3 


(ctery) = sin(47) = va 


< sin a 


3. Arithmetics, Geometric and Harmonic mean 


THEOREM 3.8. Let 41, %2,%3,...,%n > 0. We define the Arithmetic Mean by 
Lyra ts+ + Ly 
AG ti aS 2 2 ; 
n 


the Geometric Mean by 
Gy oss Dy) SA Eo A 


and the Harmonic Mean by 


ED Wi Toye Te) aes 


Then we have 


ProoF. Let f(x) = log(x). Then f”(x) = —1/x? < 0 for x > 0 so f is concave on the 
interval (0,00). It follows that 


Bi ode l l aha] e 
log (“2 * tay 5, log(ey) + og (a2) + + log(@n) 


n 
Applying the exponential function (which is an increasing function) to both sides yields 


oe ae a a ae 
: i ES pigess oL. 
n 


If we now take y; = = then we get 


te 1 
Y1 | Y2 | | Yn 1 
n VYi1y2° °° Yn 
Taking the reciprocal yields 
n n 
ae ie a ae 
yo ye | "Yn 


Ly XQ In-1 


HD) X3 Xn Ly 


Proor. Put y; = x;/2;41 for all 7. We assume that the index is cyclic, so that 2,41; = 21. 
Comparing the arithmetic and geometric average gives: 


ya thes a 
n 


Jyiy2°°° Yn = 1. 


4. The Schwarz Inequality 


Another important inequality is the Schwarz inequality. For vectors x = (#1,...,@,) and 
y = (Y1,---;Yn) in R” one defines 


LeY=UMY +++ + ILUpYn- 


Note that z-y = y-a, (e+y)-z=ax-z+y-z and (tx)-y=t(ax-y) fort € Rand z,y,z € R”. 
The norm of the vector x is defined by 


Ie] =Ve-r= Vaart +25. 


THEOREM 3.10. Suppose that x = (21, 2a, 


--5 Xn), Y = (Yi, Ya,---5Yn) € R”, then 


Piast eyya| faite tatyyete te 


or in short form: 


Iz yl < llellyl- 
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PRooF. For any vector a-a > 0. In particular, if we take a = x + ty we get 
(x +ty)-(u+ty)=2-2+ 2t(x-y)+@y-y) 20 


for all t > 0. Viewed as a quadratic polynomial in t, this polynomial has a nonpositive 
discriminant. The discriminant is 


A(x -y)’ —4(@-2)(y-y) <0 
In particular we have 
(x+y) S$ («-2)(y-y) 
and taking square roots gives us 


Izy) < Va-a/y-y = |[xllllyll. 


The Schwarz inequality is important in Euclidean geometry in dimension 2,3 or higher. 
In particular, one often defines the angle ¢ between two vectors x,y by 
cos(y) = a, Oo ar. 
Iz |Iy| 
The Schwarz inequality tells us that this definition makes sense, since the righthand side has 
absolute value at most 1. 


5. The triangle inequality 
Another famous geometric inequality is the triangle inequality. If a,b,c are the lengths 
of the sides of a triangle, then a+b > c (and alsoa+c>bandb+c>a). 
6. One more useful inequality 


THEOREM 3.11. Suppose that 21,%2,...,%n,Y1,Y2;--+5Yn are real numbers such that 
Ly < Lg S++ KS Xp and yy < yo < +--+ < Yn. Suppose that 21, z2,...,2n are the same 
aS Y1,Y2,;+++,Yn, but possibly in a different order. Then we have 


LY PPI te en S Die oko Pe Den. Se Lit Pode Et aa: 


PROOF. Suppose that 21, 22,...,2%, is a rearrangement of yj, Yo,..-,Yn. Let m be the 
number of displacements of the sequence 21, 22,...,2n, $0 m is the number of pairs (i, 7) 
with 7 < j and z; > z;. We prove the right inequality by induction on m. If m = 0 then 
z; = y; for all 2 and we have inequality. Suppose m > 0. Then z; > 24; for some 7. Note 
that the sequence 

215 F250 0 6 Si—1y Mi41s iy M42) +++ On 
(exchange z; and 2,41) has only m — 1 displacements, so by induction 


G21. 0320 Leia + Dae oP Late 2S iy 1 eaye +e + nen: 
We have 
(Cina — 2i)(i — 241) > 0, 
SO 
Lee + Lip. > Liar + Vi41% 
24 


and 


124+ %Q%+-°-4 


+ X42; 4 
> 4424 t UZ ++°+4 


UVi41Zi41 +++ 2nZn 4 
LiZin1 + 41% +++ + Laz 2 Lyi + Leye 


2 rett + LnYn- 
The left inequality in the Theorem follows from the right inequality. Note that —y, < 
SU =U, and that 245 25, 
we have 


..,—Zn iS a rearrangement of —y), —Yyo,...,—Yn-. SO 


@1(—2%) + B2(—22) + +++ + Bn(—%n) S F1(—Yn) + F2(—-Yn—-1) + +++ + En(—-M1). 


7. Exercises 
EXERCISE 3.1. ** Use the inequality at > ,/ry repeatedly to prove 
e+ytz+w 


Z > 4/xyzw 
for allie,y. 2,10 > 0. 


EXERCISE 3.2. ** Prove that 


2 
2 2 2 
Orc sae 2 a ye LjX j 
l<i<j<n 
for all positive integers n. 


EXERCISE 3.3. * Ifa <y< z and y > 0, prove that 
eg 
Seay 


y 
EXERCISE 3.4. ** For nonnegative real w1, 


...,Un, prove that 


n n 
Su? sot eh 
i=l i=l 
(use that x? is convex for x > 0). 


EXERCISE 3.5. *** Suppose that pj, po, 


,Pn are nonnegative real numbers such that 
4 p; = 1. Prove that 


n 


Se —pi log p; < logn. 
i=1 
(This inequality comes from information theory.) 


EXERCISE 3.6. ** For positive real a,b,c prove that 


Pe tea +e°h > 307i. 
EXERCISE 3.7. *** Let 


pgs 
Prove that 


(Hint: use the geometric and arithmetic mean for 1 + 1,1 + s, vee D4 4 and for 1 — s, 1- 

--,1-2)) 

EXERCISE 3.8. **** Prove the Hélder inequality: If 1/p+1/q = 1 and x,y € R” then 
Iz- yl < [xIlpllylle 


where ||x||, = (\ai|? + |x|? +--+ + |ap|?)/”. (Hint: Use that log(x) is convex and prove 
iyi < 2?/p+ y}/q. Then prove the inequality in the special case that ||z||, = |ly||, = 1. 


1 
ah 


Reduce the general case to this special case.) 


EXERCISE 3.9. * Let Q be a convex quadrilateral (i.e., the diagonals lie inside the figure). 
Let S be the sum of the lengths of the diagonals and let P be the perimeter. Prove 


1 
yee 


EXERCISE 3.10. ** Suppose that we have an triangle with sides a, b,c such that for every 
positive integer n there exists a triangle with sides a”, b” and c”. Prove that the triangle 
must be equilateral. 


EXERCISE 3.11. ** Suppose that 21, 22,...,Zp are positive real numbers. Prove that 
2 2 2 2 
oe Ga. Ee 
So ee ee ee ee 
tT 843 Ln v1 


EXERCISE 3.12. *** Prove that 
atb’c& > abbc* 
for all positive real numbers a, }, c. 


EXERCISE 3.13. * Prove that 


Uy v2 Ln-1 x 
bees 4 “_>1 
T+Xq 2+ 2X3 In-1 + Xp Int 2X1 


EXERCISE 3.14. **** Prove or disprove: If x and y are real numbers with y > 0 and 
y(y +1) < (¢ +1), then y(y— 1) < 2”. 
EXERCISE 3.15. **** Let a,b,c be positive real numbers such that abc = 1. Prove that 
se Hl yt ol De ee a 
~ @(b+c) Beta) B(atb) ~ 2 
EXERCISE 3.16. **** Let py, po,...,Pn be any n points on the sphere 
{(2,y,z) |e +y? +2? =1}. 


Prove that the sum of the squares of the distances between them is at most n?. 


EXERCISE 3.17 (USSR Mathematics Olympiad). ****** Suppose that x1, 22,...,%n are 
positive real numbers. Prove that 


XY 4) Ln n 
| bee > 


Tg + 23 3 +X4 t+%°” 4 


(indices go cyclic). 
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CHAPTER 4 


Number Theory 


1. The greatest common divisor 


If d and n are integers, then we say that d divides n if and only if there exists an integer 
q such that n = qd. Notice that if d divides n and m, then d also divides n +m and n—m 
and an + bm for all integers a, b. 


THEOREM 4.1. [fn and m are integers and m # 0, then there exists an integer q such 
that 
n=qn+r 


where0 <r < |ml|. 


PRrooF. First reduce to the case that n > 0. Then use induction on n. (Exercise.) 
A useful variation is the following result. 


THEOREM 4.2. If n and m are integers and m # 0, then there exists an integer q such 
that 


n=qn+r 
where —|m|/2 <r < |m|/2. 


Suppose that n and m are integers, not both equal to 0. The greatest common divisor 
of n and m is the largest positive integer d such that d divides both n and m. We denote 
the greatest common divisor of n and m by ged(n,m). We will also use the convention that 
gcd(0,0) = 0. The following observation will be useful. 


LEMMA 4.3. Ifn,m,q € Z, then We have gcd(n,m) = ged(m,n — qm). 


PRooF. The case n = m = 0 is clear. Assume that at least one of n and m is nonzero. 
Now gced(n,m) divides n, m and n — qm, hence 


gcd(n,m) < gcd(m,n — qm). 
On the other hand, ged(m,n — qm) divides m, n — gm and n = (n — gm) + qm. so 


gcd(m,n — qm) < gced(n,m). 


EXAMPLE 4.4. Suppose we have two large integers, say 9081 and 3270. How do we find 
their greatest common divisor? We do division with remainder: 


(24) 9081 = 3 - 3270 — 729. 
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Now gced(9081, 3270) = gced(3270, 729). We have simplified our problem! Now we can play 
this game again: 


(25) 3270 = 4-729 + 354. 
so gcd(3270, 729) = ged(729, 354). 

(26) 729 = 2- 354+ 21 
so gcd(729, 354) = ged(354, 21). 

(27) 354 =17-21=3 


so gcd(354, 21) = gced(21, 3) = 3 because 21 = 7-3. We have found that gcd(9081, 3270) = 3. 
This method of computing the gcd is called Euclid’s algorithm. 


ALGORITHM 4.5 (Euclid’s Algorithm). Suppose that r9,r; € Z are nonzero integers. 
Define q@,q2,::: € Z and ro,7r3,--- € Z inductively as follows. If r;_, and r; are already 
defined, then we define q; and r;,, by 

Tina = GTi + Tig 
where 0 < rizi < |ri| as in Theorem 4.1 (or —|r;|/2 < riga < |ri|/2 as in Theorem 4.2). 
(Note rj41 and q are well defined as long as r; # 0). 

Since |r| > |r2| > --- we have that r,4; = 0 for some positive integer k. Suppose that 
rp41 = 0 and that r; 4 0 for i < k. Then ged(ro, 11) = rp. 

If one uses Theorem 4.2 then it is clear that Euclid’s algorithm is very fast. The number 


of divisions with remainder one has to do is roughly log, |r;|. (A similar bound also holds if 
one uses Theorem 4.1 though). 


ProoF. It is not hard to show that Euclid’s algorithm works. Note that gced(rj;_1,7r;) = 
gcd(rj,ri41) for all 7. By induction on 7 one shows that gced(ro,7r1) = ged(ri, ri41) for all 7. 
In particular, 

ecd(ro, 71) = eed (ret pai) = eed(r4,, 0) =Te 


EXAMPLE 4.6. Euclid’s algorithm can be used to find integers x and y such that gcd(n, m) = 
zn+ym. It works as follows. First use Euclid’s algorithm to compute gcd(n,m) and then 
work back. For example, we already saw that gcd(9081, 3270) = 3. We would like to find x 
and y such that 


3 = 9081a + 3270y. 
Now we work back in the computation of gcd(9081, 3270). From (27) follows that 


3 = (-1)-3544 17-21. 
From (26) follows that 
3 = (-1) -354+17- (729 — 2-354) = 17-729 + (—1 —17- 2) - 354 = 17- 729 — 35- 354. 
From (25) follows that 
3 = 17-729 — 35 - (3270 — 4-729) = 
= —35 - 3270+ (17+ 35-4) - 729 = —35 - 3270 + 157 - 729. 
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and finally, from (24) follows that 
3 = —35 - 3270 + 157 - (—9081 + 3 - 3270) = 
= —157 - 9081 + (—35 + 3- 157) - 3270 = —157 - 9081 + 486 - 3270. 
This method is sometimes called the extended Euclid’s algorithm. 


ALGORITHM 4.7. (Extended Euclid’s Algorithm) Let ro,71,..-, 17%, 7 r+1 = 0 and qo, M1, ---; Ue 
as in Algorithm 4.5. 
Define x, = 0 and x4; = 1. For 7 > 1, define x,_; by induction by 


Le-i-1 = Ve-it1 — Wk-iXk-i- 


Then we have that 
ged(ro, 71) = Vigiri + BT ig 
for ¢= 0. Vk. In particular, 
ged(ro, 71) = £179 + ort. 
PROOF. Let i = k — 7. We prove by induction on 7 that 


ged(ro; Ti): =e eG 


For 7 = 0 we have to prove that 
gcd(ro,71) =1-ry +0- Te = 1p 


but this follows from Algorithm 4.5. 
Suppose that 


ged (75, 7i) = teases PRS 


If we substitute, r~_j41 = Tk-j-1 — Ue-jTk-j We obtain 


gcd(ro, 71) = Lx j+i"k—-j + Xr aT j—-1 — Gk—-jlk = 


Sp ae De AO ke See a eS 


THEOREM 4.8. Suppose that n and m are integers. There exist integers x and y such 
that 
gcd(n,m) = xn + ym. 


ProoF. This follows from the extended Euclid’s Algorithm (Algorithm 4.7). 


If n and m are integers with gcd(n, m) = 1 then we say that m and n are relatively prime. 


COROLLARY 4.9. If a,n,m are integers and a divides both n and m then a also divides 
gcd(n,m). 


PrRooF. This follows immediately from the previous Theorem 4.8. 


EXAMPLE 4.10 (Putnam 2000). Prove that the expression 
gcd(m,n) (n 
n m 
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is an integer. 


Given our previous discussion, it seems natural to write 
gcd(m,n) = am + yn 
where x,y are integers. The solution then follows quickly. 


PROOF. We can write 
gcd(m,n) = xm + yn 
where z,y € Z. Then 


is an integer. It follows that 


= n) (") 


LEMMA 4.11. Suppose that n,m,r are integers such that gcd(m,n) = 1. If m divides nr 
then m divides r. 


is an integer. 


PROOF. We can write 1 = xm + yn for certain x,y € Z. Clearly m divides xmr and 
ynr, so m divides 
r=r-l=r-(am+yn) =x2mr- ynr. 


If n and m are nonzero integers then the least common multiple lem(m,n) of n and m 
is defined as the smallest positive integer that is divisible by both m and n. Moreover we 
define lem(m, 0) = 0 for all integers m. 


LEMMA 4.12. Ifn and m are positive integers and a is an integer divisible by both n and 
m, then a is divible by lem(m,n). 


PrRooF. We can write a = qlem(m,n) + r with 0 < r < Icm(m,n). NOw r is also 
divisible by m and n. We must have that r = 0 otherwise we get a contradiction with the 
definition of lem(m,n). This proves that a is divisible by lem(m, n). 


LEMMA 4.13. [fn and m are positive integers then 
gcd(m, n) lem(m,n) = mn 


Proor. The positive integer mn/ gcd(m,1n) is divisible by m and n, so 
eee ee eh 

——~ > lem(m,n). 

ged(m,n) ~ 


The positive integer mn/lcm(m,n) divides both m and n.So we have 
mn 


lem(m,n) — ee) 
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EXAMPLE 4.14. In the strange country Oz, the only official coins are the 7-cents coin 
and the 13-cents coin. What is the largest amount that cannot be paid with these coins if a 
shop has no change at all? 


We can first make a list of amounts that we can make: 


7, 13, 14, 20, 21, 26, 27, 28, 33, 34, 35, 39, 40, 41, 42, 46, 48, 48, 49, 52, 53, 54, 55, 56, 


59, 60, 61, 62, 63, 65, 66, 67, 68, 69, 70, 72, 73, 74, 75, 76, 77, 78, 79,... 


The largest amount that cannot be made seems to be 71. It is easy to check that 71 is not 
a nonnegative combination of 7 and 13. 

Let us try to understand why all the amounts of > 72 can be made by using the two 
types of coins. Suppose that m > 72. We would like to find nonnegative integers x7 and y 
such that 


m = 13a + Ty 
If x and y are not necessarily nonnegative then this is certainly possible. We can write 
Lea 
and 
m = 13-(—m)+7- 2m. 
Using this solution we find many other solutions. If k is an integer then 
m =13-(—m+7k)+7- (2m — 18k). 


For a suitable k, we might have that both —m+ 7k and 2m— 13k are nonnegative. Consider 
the smallest k for which —m + 7k is nonnegative. Then we have 0 < —m+ 7k < 7. Now 


7+ (2m — 13k) > m—13-(—m+7k) > 72 —13-6 =—6. 


It follows that 2m — 13k > —1, so 2m — 13k is nonnegative. 
Now we would like to prove that if 


(28) 71 = 130 + Ty 


with x,y € Z, then at least one of the integers x, y is negative. We already saw one solution, 
namely 


(29) 71 =13-(—71) +7- (142). 
Combining (28) and (29) yields: 
13 - (a +71) =7- (142 — y). 


Since gcd(13,7) = 1, we have that 7 divides x + 71. Say «+71 = 7k. Then we alsy 
have that 142 — y = 13k. From x+ 71 = 7k and x > 0 follows that k > 11. But then 
y = 142 — 13k < —1. 
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2. Prime Numbers 


Recall that a prime number is a positive integer with exactly 2 positive divisors, namely 
1 and itself. 


LEMMA 4.15. If p is a prime number dividing mn where m and n are integers, then p 
divides m or p divides n. 


ProoF. If p divides m then were are done. Otherwise gcd(p,m) = 1, so p divides n by 
Lemma 4.11. 


THEOREM 4.16. (Euclid) Every positive integer can uniquely be written as a product of 
prime numbers. 


PROOF. We already have seen in problem set 1 that every positive integer is a product 
of prime numbers. We still have to prove the uniqueness. Suppose that py < po < +--+ < pr 
are distinct prime numbers such that 


DD, ene Det = pepe ons Se 
Suppose that a; > b;. Then 

pi pp ope Spy en, 
and p, divides pe ---pr*, This implies that p; divides p; for some i > 2. This is impossible 
because the only divisors of p; are 1 and p, itself. Constradiction. Therefore a, < 0. 
Similarly we see that a, > 6; and therefore a; = b,. Similarly we can prove that a; = 0; for 
i a eee oe 


Many properties can be expressed in terms of the prime factorization. Suppose that 
A= pi'py ++ py 
and 
B= ppp --- pt 
are positive integers with their factorizations into prime numbers. Then A divides B if and 
only if a; < b; for all? = 1,2,...,k. We also have that 


ged(A, B) = pip? --- py 
where c; = min(q;, };) for all 7. 
THEOREM 4.17. (Euclid) There are infinitely many prime numbers. 


PROOF. Suppose that there are finitely many distinct prime numbers, say pj, p2,.-.., Dr: 
Let N = pipo::- pp +1. Now N is a product of prime numbers. In particular there exists 
a prime numper qg such that q divides N. Now q = p; for some i with 1 <i <k. Since p; 
divides N, we have that p; divides 


N — pipo+++ pp = 1 


which leads to a contradiction. 


EXAMPLE 4.18. The number v2 is irrational. 
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PROOF. Suppose taht 2 is rational, say /2 = p/q where p,q are positive integers. 
Squaring gives p? = 2q?. The prime number 2 appears an even number of time in the prime 
factorization of p?. On the other hand, 2 appears an odd number of times in the prime 
factorization of 2q?. Wet get a contradiction, so /2 has to be irrational. 


The distribution of prime numbers is quite mysterious. One of the most famous /notorious 
open problems in mathematics is the Riemann Hypothesis. This conjecture is closely related 
to the distribution of prime numbers. The following result is well-known but not so elemen- 
tary: 


THEOREM 4.19 (Dirichlet). Suppose that a and b are positive integers that are relatively 
prime. Then there are infinitely prime numbers of the form a+ nb where n is a positive 
anteger. 


3. Exercises 


EXERCISE 4.1. ** Let Fo, Fy, Fo,... be the Fibonacci numbers: Fo = F; = 1 and Fy4, = 
F,+F,_1 for alln > 1. Prove that the greatest common divisor of two consecutive Fibonnaci 
numbers is always equal to 1. 


EXERCISE 4.2. *** Define Tp, 7),T2,... by T; = 2 and T,4; = T? —T, +1 for n > 0. 
Prove that gcd(J;,7;) = 1 for all i F j. 


EXERCISE 4.3. **** Let m > 2 be an integer and suppose that a and b are positive 
integers. Prove that 
ged(m* — 1,m? — 1) = med) _ 1, 


EXERCISE 4.4. ** We have two drinking glasses. One glass can contain exactly 21loz. 
The other glass can contain exactly 130z. Is it possible to measure exactly loz. of water 
using the two glasses? 


EXERCISE 4.5. ** Suppose that 
A i Bi De. eos pis 
is an integer, pj < po <--- < px, are distinct primes and aj,...,a, are nonnegative integers. 


Give a formula for the number of divisors of A. 


EXERCISE 4.6. *** We start with a deck of 52 cards. We put all the cards in one row, 
face down. In the first round we turn all the cards around. In the second round we turn 
every second card around. In the third round we turn every third card around. We keep 
doing this until we complete round 52. Which cards will be faced up in the end? 


EXERCISE 4.7. *** Suppose that P(x) = a,x” + dn_yv”' + +++ + a9 is a nonconstant 
polynomial with integer coefficients. Prove that there exists a positive integer m such that 
|P(m)| is not a prime number. 


EXERCISE 4.8. ** Show that there are infinitely many prime numbers of the form 4k — 1 
where & is a positive integer (without using Dirichlet’s Theorem). 


EXERCISE 4.9. * Find all prime numbers p for which p+ 2 and p+ 4 are also prime 
numbers. 
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EXERCISE 4.10. **** Find all integers n for which ¢(n) divides n. 
EXERCISE 4.11. * Use Euclid’s Algorithm to find ged(1029, 791). 
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CHAPTER 5 


Modular Arithmetic 


1. The Chinese Remainder Theorem 


Suppose that m is an integer. We say that a is congruent to b modulo m if m divides 
the difference a — b. The notation we use is 


a = b(mod m). 
Notice that if a; = bi(mod m) and az = b2(mod m) then 
a, + a2 = b; + bo(mod m) 


and 
a1A2 = b,b2(mod m). 


THEOREM 5.1 (Chinese Remainder Theorem). Suppose that m and n are positive integers 
such that gcd(m,n) = 1. Suppose that a and b are integers. Then there is a unique integer 
c with 0O<ce< mn such that 

c =a(mod m) 
and 
c = b(mod n) 


PROOF. We find can find integers x,y € Z such that am + yn = 1. Note that xm = 
1(mod n) and yn = 1(mod m). Consider d = bam + ayn. We can write d = qmn+c with 
0<c< mn. Then 

c =d= ayn = a(mod m) 
and 

c=d= bam = b(mod n). 
This shows the existance of c. Suppose that c’ is another integer with c’ = a(modm), c= 
b(modn) and 0 < c < mn. Then c— ¢’ is divisible by m and by n. Because ged(m,n) = 1, 
c—C is divisible by mn. Since |c — c'| < mn we must have that c= c’. 


The previous theorem easily can be generalized as follows. 


THEOREM 5.2. Suppose that m,,ma,...,™m are pairwise relatively prime positive integers 
(so gcd(m;,m;) = 1 fori A j). Suppose that ay,a2,...,a,% are integers. Then there is a 
unique integer c with0<c<mym2:--mMp such that 
c = a;(mod m;,) 
fori =1,2,...,k. 
EXAMPLE 5.3. Show that the difference of two consecutive prime numbers can be arbi- 
trarily large. 
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We want to show that for every m there exists an n such that n+ 1,n+ 2,...,n +m 
are not prime. Let us assume that n+ 1 is divisible by 2 and that n > 2. Then n+ 1 is not 
a prime number. Now n+ 2 is not divisible by 2. However, we could assume that n + 2 is 
divisible by 3 and n+ 2 > 3. Then n+ 2 is certainly not a prime either. Similarly we could 
assume that n+ 3 is divisible by 5 and n +3 > 5. The Chinese Remainder Theorem comes 
to the rescue. 


PROOF. Let p1,po,.--,Pm be the first m prime numbers. Using the Chinese Remainder 
theorem we can find an integer c such that 
n = —i(modp;) 


for i = 1,2,...,m. Without loss of generality we may assume that n > p; for all 7 (otherwise 
we may add a multiple of pip2---pm ton). For every 7 in {1,2,...,m} we see that p; divides 
n+i but c+i> p;. This shows that c +7 is not a prime number. 


A slightly easier proof is the following. 
PROOF. For every n, consider the numbers 


n!+2,n!43,---,nl+n. 


all these numbers are not prime numbers because n! + 7 is divisible by 7. 


2. Euler’s function 
For an integer n we define ¢(n) as the number of elements in the set 
{ae Z|l<a<n, gced(a,n) =1} 
of all positive integers a which are relatively prime to n. 
LEMMA 5.4. Ifm and n are positive integers then ¢(mn) = o(m)¢(n). 


PROOF. Givenc € {1,2,..., mn} we can find unquea € {1,2,...,n}andb € {1,2,...,m} 
such that 


(30) c =a(mod n) 
and 
(31) c = b(mod m). 


Conversely, givena € {1,2...,n}andb € {1,2,...,m} there exists a unique c € {1,2,...,mn} 
such that (30) and (31) hold by the Chinese remainder theorem. We have 


gcd(c, mn) = ged(c,m) ged(c, n) = ged(a,n) ged(b, m). 


So ged(c,mn) = 1 if and only if ged(a,n) = gced(b,m) = 1. There are $(n) choices for a 
such that gcd(a,n) = 1. There are ¢(m) choices for b such that gcd(a,m) = 1. Therefore, 
there are ¢(n)¢(m) choices for a and b such that gcd(a,n) = ged(b,m) = 1. So there are 
@(m)(n) choices for c such that ged(c,nm) = 1. This shows that ¢(mn) = ¢(m)¢(n). 


LEMMA 5.5. If p is a prime number and k is a positive integer then 
(p") = (p— 1p. 
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PROOF. The elements of 
fie ou | 
that are not relatively prime to p* are exactly the p*—! multiples of p. This shows that 


o(p") = p* — p** = (p— 1)p*. 


In general if n = p{'p5?---p;* is the prime factorization of n, where py; < po <--+ < Dx 
are distinct prime numbers and aj, da2,..., a, are positive integers, then 
p 2 ? ? p soa ? 


b(n) = (pr — 1)pf!* (pe — 1) 9271 +++ (pe — 1)pgF*. 
3. Exercises 


EXERCISE 5.1 (Gardner, M., The Monkey and the Coconuts, Ch. 9 in The Second 
Scientific American Book of Puzzles & Diversions: A New Selection. New York: Simon and 
Schuster, pp. 104-111, 1961.). *** Five sailors survive a shipwreck and swim to a tiny island 
where there is nothing but a coconut tree and a monkey. The sailors gather all the coconuts 
and put them in a big pile under the tree. Exhausted, they agree to go to wait until the 
next morning to divide up the coconuts. 

At one o’clock in the morning, the first sailor wakes. He realizes that he can’t trust the 
others, and decides to take his share now. He divides the coconuts into five equal piles, but 
there is one left over. He gives that coconut to the monkey, buries his coconuts, and puts 
the rest of the coconuts back under the tree. 

At two o’clock, the second sailor wakes up. Not realizing that the first sailor has already 
taken his share, he too divides the coconuts up into five piles, leaving one over which he 
gives to the monkey. He then hides his share, and piles the remainder back under the tree. 

At three, four and five o’clock in the morning, the third, fourth and fifth sailors each 
wake up and carry out the same actions. 

In the morning, all the sailors wake up, and try to look innocent. No one makes a remark 
about the diminished pile of coconuts, and no one decides to be honest and admit that 
they’ve already taken their share. Instead, they divide the pile up into five piles, for the 
sixth time, and find that there is yet again one coconut left over, which they give to the 
monkey. 

How many coconuts were there originally? (Find the smallest number of coconuts that 
is consistent with this story.) 


EXERCISE 5.2. * Use the Extended Euclid’s Algorithm to find integers x,y € Z such 
that 2682 + 421ly = 1. 


EXERCISE 5.3. ** Find a multiple of 2003 that ends with the digits 9999. 
EXERCISE 5.4. ** Find an integer x such that x? + 1 is divisible by 130. 
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CHAPTER 6 


Polynomials 


1. Introduction 


In this problem set we will consider polynomials with coefficients in kK, where K is the 
real numbers R, the complex numbers C, the rational numbers Q or any other field. (A 
field is a number system satisfying certain axioms, but if you have not heard about this 
before, just think of the examples we just mentioned.) A polynomial in the variable x with 
coefficients in K is an expression 


f(x) = dpe” + Qn_iz™ | +---+a,2 + a9 


with ao, @1,@2,...,@) € K. If a, #0 then f is said to have degree n. We may denote this 
by deg(f(a)) =n. For convenience, we also define the degree of the zero polynomial 0 by 
deg(0) = —oo. The polynomial is called monic if a, = 1. Now Ka] denotes the set of all 
polynomials in the variable x with coefficients in K. 

In many ways, polynomials behave similar to Z (this is because K |x| and Z are both 
so-called principal ideal domains). As for the integers Z, we can define gcd and lcm for 
polynomials. There also exists an Euclidean algorithm To prove these results for polynomials, 
one could simply copy the proofs for the same results for the integers. 

Suppose that f(x), g(x) € Ka]. First, we say that a polynomial g divides a polynomial f 
if f(a) = g(x)h(x) for some polynomial h € Ka]. A monic polynomial f is called irreducible 
if it has exactly 2 monic divisors (namely 1 and itself). 

For example, z?+1 € R[z] is irreducible. Indeed if x? +1 is a product of two polynomials 
of degree 1, then 2? + 1 = (x + a)(x + b) and —a € R would be a zero of x? + 1 which is 
impossible. Seen as a polynomial with complex coefficients z?+1 € C[z] is reducible, namely 
x? +1=(x+i%)(x —1). The polynomial x? — 2 € Q{z] is irreducible by a similar reasoning 
because V2 is irrational. 

These monic irreducible polynomials play the role of prime numbers. For example every 
monic polynomial is a unique product of monic irreducible polynomials (as we will see). 

Sometimes we will also consider polynomials in several variables. For example K |x, y] 
denotes the polynomials in x7 and y with coefficients in K. These can be seen as polynomials 
in y with coefficients in K |x], or polynomials in x with coefficients in K[y]. 


2. Division with remainder 


All polynomials considered have coefficients in K. We will develop a theory similar to 
the theory of integers. 


THEOREM 6.1. Jf f(x), g(x) are polynomials and g(x) #4 0, then there are unique poly- 
nomials q(x) and r(x) such that 


f(x) = q(a)g(@) + r(2) 
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with deg(r(x)) < deg(g(z)). 


We will call g(x) the quotient and r(x) the remainder. Theorem 6.1 can be done explicitly 
using a long division just like you would do for integers. For example, let us divide 7° + 
32° + 22 — 1 by 2? — 2 42: 


z—x4+2/ 2° +32 +22 -1 \z342742¢ 
eo oe So 
eg +03 
go ge? Ay? 
Ig 9g? 0G 
ae ee 
—27% —-1 


Therefore the quotient is 7? + x? + 2x and the remainder is —2x — 1 (You may have learned 
the long division slightly different. For example, it is a cultural thing where you put the 
quotient. Also, the horizontal bars weren’t meant to be quite this long but I didn’t figure it 
out how to do this in TX properly.) 


THEOREM 6.2. Suppose that f(x), g(x) are nonzero polynomials, and let h(x) be a nonzero 
monic polynomial of smallest degree such that both f(x) and g(x) divide h(x). This polyno- 
mial h(x) is unique and we call it lem( f(x), g(x)). Moreover if u(x) is any common multiple 
of f(x) and g(x) then lem(f (x), g(x)) divides u(x). 


ProoF. If u(x) is a common multiple of f(a) and g(x) then we can write u(x) = 
q(x)h(x) + r(x) with deg(r(a)) < deg(h(x)) and r(x) is a common multiple of f(a) and 
g(x). Now r(x) cannot be nonzero by minimality of deg(h(x)). Therefore r(z) = 0 and 
h(a) divides u(x). We now prove uniqueness of h(x). If u(a) were another nonzero monic 
polynomial with minimal degree such that f(x) and g(x) divide v(x), then h(x) must divide 
v(x) and v(x) must divide h(x). Since both polynomials are monic, we get h(x) = v(z). 


THEOREM 6.3. If f(x), g(x) are nonzero polynomials, then there is a nonzero monic 
polynomial h(x) of largest degree such that h(x) divides both f(x) and g(x). The polynomial 
h(x) is unique and we call it gcd( f(x), g(x)). Moreover, if u(x) is any polynomial dividing 
both f(x) and g(x) then u(x) divides gcd( f(x), g(x)). 


ProorF. Define h(x) as a nonzero polynomial of smallest possible degree such that it is 


of the form 
a(x) f(x) + (x) g(x) 


for some polynomials a(x) and b(x). We may assume that h(x) is monic by multiplying with 
a constant. Using division with remainder, we find q(x) and r(x) such that 


f(x) = q(@)h(a) + r(x) 
with deg(r(x)) < deg(h(x)). Now 
r(x) = fle) — q(@)(ala) f(a) + 02) g(@)) = (1 — a(w)a(x)) f 
Because of the minimality of the degree of h(x), we must have r(x 


divides f(x). Ina similar way one can prove that h(x) divides g(x 
dividing both f(x) and g(x) then u(x) also divides h(x) = a( 
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xt) + (—q(x)0(@)) g(x). 
= (0. This shows that h(z) 
If u(x) is any polynomial 


(2). 
x)f (x) + b(x) g(x). This also 


( 
) 


shows immediately that h(x) isa common divisor of f(x) and g(x) of largest possible degree. 
If u(x) is another monic common divisor of f(x) and g(x) then u(x) divides h(x) and since 
both are monic of the same degree we get u(x) = h(x). This shows the uniqueness. 


The previous proof shows in particular that for nonzero polynomials f(x) and g(x), there 
always exists polynomials a(x) and 6(x) such that 


ged( f(x), g(a)) = a(x) f(x) + b(@)g(@). 
One could also define Ilem( f(x), 0) = lem(0, f(x)) = 0 and ged(f(x),0) = ged(0, f(x)) = 
f(x) for any polynomial f(z). 


3. Euclid’s Algorithm 


We can also compute the greatest common divisor of two nonzero polynomials f(z) 
and g(x) using the Euclidean algorithm. Let us assume that deg(f(x)) > deg(g(x)). Put 
ro(x) = f(x) and r(x) = f(x). If r;(~) 4 0 then we define r;,,(x) and q;(x) inductively by 

rj-1(£) = qi(x)ri(x) + rigi (2) 
where q;(%), Ti41(v) € K [a] and deg(rj41(x)) < deg(r;(x)). Since deg(ro(x)) > deg(ri(x)) > 
- we must have r;,4;(xz) = 0 for some k. So we have 
ro(t) = qn(x)ri(x) + r2(2) 


Ti(x) = q(x)ro(x) + 73(z) 


Te-2(@) = qu-1(X)re-1(@) + re(2) 


Tae) 2 Gel 2) Cee) 


Up to a constant r(x) is equal to gced(f(zx),g(x)). All proofs are similar to the GCD 
algorithm for integers. 
4. Chinese Remainder Theorem 
DEFINITION 6.4. If a(x), b(x), f(x) € K [x] are polynomials then we write 
a(x) = b(x) mod f(z) 
if f(a) divides the a(x) — b(z). 
We now can formulate a Chinese Remainder Theorem for polynomials. 


THEOREM 6.5 (Chinese Remainder Theorem). Jf f(x) and g(x) are polynomials with 
gcd( f(x), g(x)) = 1 and a(x), b(x) € K[a] are polynomials, then there exists a polynomial 
c(x) € Ka] with deg(c(x)) < deg(f(x)) + deg(g(x)) such that 


c(xz) = a(x) mod f(z) 


and 
c(x) = b(x) mod g(x) 
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5. Unique Factorization 


Just as for the unique factorization into prime numbers, every polynomial has a unique 
factorization into irreducible polynomials. 


THEOREM 6.6. Every monic polynomial in K[x] can be uniquely (up to permutation) 
written as a product of monic irreducible polynomials. 


6. The Fundamental Theorem of Algebra 
THEOREM 6.7 (Fundamental Theorem of Algebra). If P(z) = 2"+an_12" 1+: +++a12+a9 


is a polynomial with complex coefficients ag, a1,...,4n—1, then 
P(z) = (¢—21)(2 — 22) +++ (2 — an) 
for some complez numbers x1, %2,...,Un. Here £1,%2,...,Xn are exactly the zeroes of P(z), 


(some zeroes may appear several times, we call them multiple zeroes). 


The fundamental theorem of algebra implies that the only irreducible monic polynomials 
over the complex numbers are of the form z —a, with a € C. 


COROLLARY 6.8. If P(z) = 2" 4+ dn_12" 1 +++: +a,2 + a9 is a polynomial with real 
coefficients ag, @1,...,Qn—1, then we can write 


P(z) = Q1(z)Q2(z) +++ Q,(Z) 


where Q;(z) is a monic polynomial of degree 1 or 2 for every i. 


PROOF. It suffices to show that irreducible polynomials over R have degree 1 or 2. Sup- 
pose that P(z) is an monic irreducible noncosntant polynomial. According the the previous 
theorem, there exists a complex root a € C. Ifa is real, then P(z) is divisible by z—a and we 
must have P(z) = z — a because P(z) is irreducible. If a is not real, then P(@) = 0 as well, 
where @ is the complex conjugate of a. Let Q(z) = (z —a)(z—-@) = 27 — (a+@)z+<aa. The 
polynomial Q(z) divides P(z) and Q(z) has real coefficients. Because P(z) is irreducible, we 
have that P(z) = Q(z). 


Suppose that P(z) is a monic polynomial of degree n with zeroes 21, 2%2,--+ ,%p. Then 
we have 
P(z) = (¢—41)(2 — 2g) +++ (2 — n). 
If we multiply this out we get 


P(z) = 2" —e,z™ 1 + eg2™ 7 —--- + (—-1)"en 


Ee, = ) Lj, Lig ** Liz. 


1<01 <t2<-+*<ap<n 


where 


are called the elementary symmetric polynomials. In particular we have 
€y = M+ %o4+-++++ Uy 


which is the sum of all zeroes, 


€2 = 1X2 + XyM3 + +++ XX + Vo%3 + Log tees + LAM, +++ + UMy_14Ly. 


which is the sum of all products of distinct variables, and 
Cn = %1%2°°** Un 


which is just the product of all variables. You probably know the case n = 2. In that case 
we get P(z) = (z — 21)(z — 22) = 27 — (41 + @2)2 + 21%, 80 $1 = 1714 FQ and Sy = 11,29. 


A polynomial Q(21, 22,...,%n) in the variables 71, 22,...,2n is called symmetric if 
Q(x1, te Ei) = lem ene te ,Lo(2)) 
if o(1),0(2),...,0(n) is a permutation of 1,2,...,n. The elementary symmetric polynomials 


are of course symmetric. Other important symmetric polynomials are the power sums: 
Pe= Up +ug+--- tak. 
7. Exercises 
EXERCISE 6.1. ** Show that 
Pntk — €1Pn+b—1 + €2Pn+k—2 — +++ + (—1)"enpe = 0. 
EXERCISE 6.2. * Show that e2 = (p? — p2)/2, and e3 = (p} — 3p1p2 + 2p3)/6. 


EXERCISE 6.3. ** Suppose that P(z) = 2” + dn_12""1 +--+ + ao is a polynomial with n 
distinct real zeroes, 171, %2,--- ,%,. Express 


1 1 i 


in terms of ag, @1,...,Qn—1. 
EXERCISE 6.4. *** Suppose that 21, 22,23 are complex numbers with 
v1 + Lo + £3 =2i+a34 23 =rit+at+a2 = 10. 
What is a} + v3 +23? (Hint: Use problem 6.1 and problem 6.2.) 


EXERCISE 6.5. **** Show that we have equality of formal power series 


5 eee ee = Pie | D2 2 | P3 3 bog dea 
A~ i 1-4 9 T 3 T 


EXERCISE 6.6. *** Suppose that |z| < 1. Show that 


Wore ee =|[[a+~". 


EXERCISE 6.7. (Polya’s Theorem)***** If P(z) = z”+ay)_12" !+-+++ap is a polynomial 
with real coefficients, such that P(z) > 0 for z > 0. Prove that (1+ z)”"P(z) has nonnegative 
coefficients. (For example, 1 — 3z + 3z7 > 0 for all z > 0, and 

(14 z)8(1 — 324327) =14 10z 4 422? + 9127 + 91244 
+12872* + 4292" + 2002z2° + 20022"° + 13652"! + 637z"? + 196z"* + 362"* + 32° 


has nonnegative coefficients. By the way, 13 was the smallest power with this property here. 
Hint: Use the fundamental theorem of algebra for polynomials with real coefficients.) 


43 


EXERCISE 6.8. *** A polynomial P(z) with real coefficients of degree n starts with 
az” + bz" 1 + ez" 2 +4... 


Show that if P(z) cannot have n real zeroes if b? — 2ac < 0. Also show that there exists a 
polynomial P(z) with n real zeroes for which b? — 4ac < 0. 


EXERCISE 6.9. * A polynomial P(z) = anz"+dn_12" 1+: +++a ,2z+49 is called symmetric 
if a; = G,_; for all i. (Assume that a, 4 0.) Prove that for a symmetric polynomial P(z) we 
have that x is a zero of P(z) if and only if 1/z is a zero of P(z). 


EXERCISE 6.10. ** Find all zeroes of the (symmetric) polynomial 
P(z) = 24 + 102° + 2327 + 10z +1. 


(Hint: first prove that there exists a factorization P(z) = (27 + az + 1)(z? + bz 4 1) using 
the previous problem.) 


EXERCISE 6.11. ** 


(a) Prove Theorem 6.1 (for example by induction with respect to deg(f())). 

(b) Suppose that f(x) is a polynomial with coefficients in K and f(a) = 0 for some a € 
Kk. Prove that you can write f(x) = (x — a)q(x) for some polynomial g(x) € K {2}. 

(c) Use this to show that a nonzero polynomial of degree n has at most n zeroes. 


EXERCISE 6.12. *** Find all polynomials (with real coefficients) f(x) such that 
f(x?) = (f(a))’. 
EXERCISE 6.13. ** If f(x) is a polynomial, prove that we can write 
f(x) — Fy) = a(@, y) (a — y) 
where a(x, y) is a polynomial in two variables. (Hint: Reduce to the case f(x) = x”.) 


EXERCISE 6.14. * Suppose that f(x) is a polynomial with integer coefficients, and that 
a,b are integers. Show that f(a) — f(b) is divisible by a — b. In particular, if a= bmodn 
for some integer n, then f(a) = f(b) mod n. 


EXERCISE 6.15. *** Let P(x) be a polynomial with integer coefficients. Prove: There 
do not exist three distinct integers a,b,c such that P(a) = b, P(b) =c and P(c) =a. 


EXERCISE 6.16. *** Find a polynomial P(x, y,z,t) (with real coefficients) such that 
P(2,0,0,1) = 2001, but P(a,b,c,d) = 0 for all other quadruples of integers with 0 < 
G;0,.0-d <9: 

EXERCISE 6.17. *** 

(a) Prove the identity 


cos(naz) = 2. cos(x) cos((n — 1)x) — cos((n — 2)2) 


for natural numbers n and z € R. 

(b) Prove that for every natural number n there exists a polynomial T,,(x) of degree 
n such that T,,(cos(a”)) = cos(nx). (These polynomials T;,() are called Chebycheu 
Polynomials). 


44 


(c) Prove that |T;,,(x)| < 1 for |x| < 1 and that the leading coefficient of T;,,(x) is 2”~' 
(ies Tyee eee 


EXERCISE 6.18. ***** Suppose that P(x) = anx2" +an_1v"'++-+++a9 is a polynomial of 
degree n and |P(x)| <1 for |z| < 1. Prove that |a,,| < 2"~'. (Hint: You may use the results 
of the previous problem. If |a,| > 2”~', then show that T,(x) — 2" P(x) is a polynomial of 
degree n — 1 with at least n zeroes.) 


EXERCISE 6.19. ***** Suppose 


f(x) — fly) = a(z, y)(9(x) — g(y)) 


for some polynomials f(x) and g(x) and a polynomial a(x, y) in two variables. Prove that 
there exists a polynomial h such that f(x) = h(g(2)). 


EXERCISE 6.20 (Putnam 1986, A6). ***** Let ai, a2,...,@n be real numbers, and let 
bi, bo,..., bn be distinct positive integers. Suppose that there is a polynomial f(x) satisfying 
the identity 


Find a simple expression (not involving any sums) for f(1) in terms of 6), b2,...,b6, and n 
(but independent of a1, a2,..., dn.) 


EXERCISE 6.21 (Putnam). ***** Let f(x) be a polynomial with integer coefficients. 
Define a sequence ao, a1,... of integers such that a9 = 0 and a,41; = f(a,) for all n > 0. 
Prove that if there exists a positive integer m for which a,, = 0, then either a, = 0 or ag = 0. 


EXERCISE 6.22. *****(Gauss’ Lemma) 

(1) If f(z) = a,v"+a,_,2"~!+--+++a9 is a polynomial with integer coefficients, then the 
content c(f(x)) of f(a) is defined as gcd(ap,@n_1,.-.,@9). Prove that c(f(x)g(x)) = 
c(f(x))e(g(x)) if f(x) and g(x) are polynomials with integer coefficients. (Hint: 
Reduce to the case that c(f(x)) = c(g(x)) = 1. Then reduce the polynomials 
modulo some prime numbers and see what happens.) 

(2) If f(x) is a polynomial with integer coefficients and f(x) = a(x)b(x) with a(x) and 
b(x) nonconstant polynomials with rational coefficients, then one can find polyno- 
mials a(x) and b(a) with integer coefficients such that f(x) = a(x)b(x). (In other 
words, f(a) is reducible over Q if and only if f(x) is reducible of Z.) 


EXERCISE 6.23. ***(Interpolation) Suppose that a1, @2,...,@, € R are distinct and that 
by, bo,...,6, € R. Prove that there exists a polynomial f(x) with real coefficients such that 
f(a:) = b; and f has degree at most n — 1. (Hint: For i = 1,2,3,...,n define a polynomial 
f, such that f;(a;) =0 for all j Ai and f;(a;) = b;. Then define f = 5°, fi.) 


EXERCISE 6.24. ***** Prove the Eisenstein criterion. If f(a) = an2" + dn—yv" 1 +++++ 
a,x+4do is a polynomial with integer coefficients, and p is a prime number such that p divides 
1, @,...,Qn—1, p does not divide a, and p? does not divide ag. Then f(z) is irreducible over 
Q (it suffices to show, using the previous problem, that it is impossible to write f(a) as a 
product of two nonconstant polynomials with integer coefficients). 
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EXERCISE 6.25 (Putnam 1985, B2). *** Let k be the smallest positive integer for which 
there exist distinct integers m ,,™m2,™m3,m4,ms5 such that the polynomial 
p(x) = (% — m1)(x — m2)(x — ms) (x — ma)(x — ms) 
has exactly k nonzero coefficients. Find, with proof, a set of integers m,,™m2,™m3, m4, Ms for 
which this minimum k is achieved. 


EXERCISE 6.26 (Putnam 1985, A4). **** Define a sequence {a;} by a, = 3 and aj, = 3% 
for i > 1. Which itnegers between 00 and 99 inclusive occur as the last two digits in the 
decimal expansion of infinitely many a,;? 


EXERCISE 6.27. * What are the quotient and the remainder of division of 27 + 2° — 2+ + 
22° + 4x? — 1 by 2? +27 -—2+1? 
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CHAPTER 7 


Pigeonhole Principle 


The pigeonhole principle is the following observation: 


THEOREM 7.1. Suppose that > kn marbles are distributed over n jars, then one jar will 
contain at least > k +1 marbles. 


(It can also be formulated in terms of pigeons and pigeonholes, hence the name.) The 
proof of this pigeonhole principle is easy. It is more difficult to know when to apply it. There 
are many surprising applications of the pigeonhole principle. The pigeonhole principle was 
first explicitly formulated by the mathematician Dirichlet (1805-1859). 

The pigeonhole principle says for example that at least two people in New York City will 
have the same number of hairs on their head. This is because humans have < 1,000,000 
hairs and there are > 1,000,000 people in NYC. 

The pigeonhole principle is particularly powerful in existence proofs which are not con- 
structive. For example in the previous example we proved the existence of two people with 
the same number of hairs without specifically identifying these two individuals. 


EXAMPLE 7.2. How many bishops can one put on an 8 x 8 chessboard such that no two 
bishops can hit each other. 


It seems like a good idea to put 8 bishops in one row at the edge of the board. If we put 
6 bishops on these positions then we see that still no two bishops can hit each other. So we 
see that at least 14 bishops can be put on the chessboard. We conjecture that this number 
is maximal. 

How can we prove that 14 is the maximal number of bishops? We can use the pigeonhole 
principle. We need to partition the 8 x 8 = 64 fields into 14 sets such that whenever two 
bishops are on fields which lie in the same set, then they can hit each other. 

Note that a bishop on a black field only can move to other black fields. A bishop on a 
white field can only move to other white fields. To prove that 14 is the maximal number of 
bishops, we could prove that there at most 7 “black” bishops and at most 7 “white” bishops. 
We try to partition the 32 black fields into 7 sets such that if two bishops are on fields in 
the same set, then they can hit each other. We see that the following configuration works: 
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4 5 6 7 


A similar configuration works for the white fields. (Take the mirror image.) In the proof 
that we are going to write down, we do not need to distinguish between “black” and “white” 
bishops. We can combine the partition of black fields and the partition of the white fields 
to get a partition of the set of all 64 fields into 14 subsets. 


PROOF. The answer is 14. Place 14 bishops on the chessboard as follows: 


44444444 


444444 


Then no two bishops can hit each other. 
Let us label the fields on the chessboard as follows: 


— 
i) 
i) 
— 
— 
Ww 
— 
fo) 
Mm |\o | | 


NILO;M\) or; 


14) 4) 13) 5) 12} 6] 11] 7 
4|) 14) 5 | 13}6 | 12) 7) 11 


Whenever two bishops are placed on the chessboard with the same number, they can 
hit each other. If a number of bishops are placed on fields of the chessboard such that no 
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two can hit each other, then all the numbers of the fields are distinct. This shows that the 
number of bishops is at most 14. 


EXAMPLE 7.3. (Putnam 1958) Let S be a subset of {1,2,3,...,2n} with n+1 elements. 
Show that one can choose distinct elements a,b € S such that a divides b. 


Is the n + 1 in the problem sharp? Suppose that S' has the property that for every pair 
of distinct a,b € S, a does not divide b and b does not divide a. How many elements can S' 
have? 

If S contains 1 then it could not contain any other element. If S contains 2 then all other 
even numbers are excluded. In order to get S as large as possible, it seems that one should 
choose large elements. If we take the n largest elements, S = {n+1,n+4+ 2,...,2n} then 
clearly no two distinct elements divide each other. Any positive integer a with a < n divides 
an element of S. This shows that S is maximal with the desired property. 

The problem has somewhat of a pigeonhole flavor: We are asked to prove the existence 
of certain elements a,b € S but it seems unlikely that we can explicitly construct these 
elements. 

How can we apply the pigeon hole principle? Since we have n+ 1 elements we partition 
{1,2,...,2n} into n sets T,,7>,...,7;,. The pigeonhole principle says that SM T; contains 
at least two elements for some 7. This then should be useful to conclude that there exist 
a,b € S such that a divides b. So we would like 7; to have the following property: Whenever 
a,b € T; with a < b then a divides b. So T; should be of the form 


{a1, 2, 43,.-. ar} 
with a; | a2 | a3 |---| ax. The largest such set is 
PID DA098. a5 He 


Let us define 7; to be this set. The smallest element not in 7) is 3. So let us define 
P09 8032083 HO 
The smallest element, not in 7; and T> is 5. so let us define 
TS) 50 Gee he be 
Let us define more generally 
Ti, = {(2k — 1), (2k — 1) - 2, (2k — 1) - 2?, (2k — 1) -2°,...} 


for k = 1,2,...,n. It is easy to see that the union of 7), 7>,...,7;, contains. {1,2,...,2n}. 
We are now ready to write down the proof, using the pigeonhole principle. 


PROOF. Let us define 
Tr. = {1,2,...,2n}M {(2k — 1), (Qk —1)- 2, (2k — 1) -2?, (2k — 1) - 2°,...} 
for all k = 1,2,...,n. Every c € {1,2,...,2n} can uniquely be written as c = 2/(27 — 1) 
with 7 € N andi € {1,2,...,n}. This shows 7), 7>,...,T;, is a partition of {1,2,...,2n}. 
By the pigeon hole principle, S 9 7; contains at least two distinct elements for some 7, say 


{a,b} C SOT; with a < b. then a = 2/(2i — 1) and b = 2*(2i — 1) for some j,k € N with 
j <k and it is clear that a | b. 
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EXAMPLE 7.4. Suppose that S is a set of n integers. Show that one can choose a 
nonempty subset T’ of S such that the sum of all elements of T is divisible by n. 


One can try to attack this problem using the pigeonhole principle but it is not immediately 
clear how we can apply it. What are the “pigeonholes” here? Since our goal is to prove 
that something is divisible by n, it is natural to take the congruence classes modulo n as 
pigeonholes. The pigeonhole principle says: “Given n+1 integers, one can choose two of them 
such that their difference is divisible by n.” How can we apply this here? We can apply it if 
we have integers a1, @2,...,@n41 such that a; — a; is a sum of distinct elements of S for all 
i <j. We can indeed achieve this. Suppose that S = {b1, bo,..., bn}. Take a; = 0, ay = by, 
az = b, + be, ag = b; + bg + b3, etc. We can write down our proof: 


PROOF. Suppose that S = {b;,b2,...,b,}. Define 
cg = b+ be +--- +0; 
for all i = 0,1,...,n (co = 0). Of the n +1 numbers co, ci,...,Cn, at least 2 must lie 
in the same congruence class module n by the pigeonhole principle. Assume that we have 


c; = c;(modn) for i < j. Then we get that 


CG = bes Dye ee 8; 


is divisible by n. 


EXAMPLE 7.5. Suppose that we are given a sequence of nm + 1 distinct real numbers. 
Prove that there is an increasing subsequence of length n+ 1 or a decreasing subsequence of 
length m+ 1. 


To get an idea, let us do a random example with n = m = 3: 
(32) 55, 63, 57, 60, 74, 85, 16, 61, 7, 49. 


What is the longest increasing subsequence and what is the longest decreasing subsequence? 
How can we efficiently find these without checking all possible subsequences? 

For a longest decreasing sequence in (32) there are two cases. Either such a sequence 
ends with 49 or it does not. If the sequence does not contain 49, then a longest decreasing 
sequence of (32) is a longest decreasing subsequence of the shortened sequence 


(33) 50, 63, 57, 60, 74, 85, 16, 61, 7. 


If 49 appears in a longest decreasing subsequence, then we may wonder what the previous 
element in that subsequence is. Is it 55,63,57,60,74,85 or 61? (Clearly 16 and 7 are out of 
the question because the subsequence is decreasing.) It would now be useful to know for 
each x in {55, 63,57, 60,74, 85,61} what the longest decreasing subsequence is of (33) that 
ends with x. We then could take the longest decreasing subsequence of (33) ending with 
some x in {55, 63,57, 60,74, 85,61} and add 49 at the end to obtain a longest decreasing 
subsequence of (32). 
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So to find the longest decreasing subsequence of (32) we need to find the longest decreas- 
ing subsequence of (32) ending with «x for x = 55,63,...,49 (in that order). 


x a longest decreasing subsequence ending with x 


55 55 
63 63 

57 63,57 
60 63, 60 
74 74 

85 85 

16 63, 60, 16 
61 85,61 

G 63, 60, 16,7 
49 85,61, 49 


We have found that the longest decreasing subsequence has length 4. Namely the subse- 
quence 63,60,16,7 is decreasing. So the statement we want to prove works out in this example. 
Let us determine the longest increasing sequence: 


x a longest decreasing subsequence ending with x 


55 55 
63 55,63 

57 55,57 

60 55,57, 60 

74 55,57, 60, 74 
85 55,57, 60, 74, 85 
16 16 

61 55,57, 60, 61 
7 7 

49 7,49 


We see that there even is an increasing sequence of length 5. 

We checked one (small) example and it seems that we are still far from a solution (but 
this is actually not the case). 

We are interested in the lengths of maximal increasing/decreasing sequences. So let us 
make a table containing the length of a longest increasing sequence ending in x and the 
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length of a longest decreasing sequence ending in «x for all zx. 


x decreasing increasing 
55 1 1 
63 
57 
60 
74 
85 
16 
61 
t 
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Let us plot the last two columns against each other. We get 10 distinct points 


GF 9 We 2 ees mallee 9 Te leat ya cs ve ee ea 


5 @ 
4 e e 
3 e 


2 @ e e 


1 @ @ e 


1 2 3 4 5 
If there were no increasing or decreasing sequence of length 4, then all points would fit in 
a 3 x 3 box and two of the points would have to coincide by the pigeonhole principle. This 
leads to a contradiction as the following proof shows. 


PROOF. Suppose that 
U1, LQ,++- »>Umn+1 


is a sequence of distinct real numbers. Let a; be the length of the longest decreasing sub- 
sequence ending with x;. Let b; be the length of the longest increasing subsequence ending 
with x;. We claim that if i # 7, then that (a;,b;) A (a;,0;). 

Indeed, if x; < x; then we can take a longest decreasing subsequence ending with x; and 
add x; at the end. This way we obtain a decreasing subsequence ending with 2; of length 
a; + 1. This shows that a; > a; and (a;,b;) 4 (aj, bj). 

If x; > x; then we can take a longest increasing subsequence ending with 2; and add x; 
at the end. This shows that b; > b; and (a;,b;) 4 (a;, b;). 

There are only nm pairs (a,b) with a,b € Zand 1<a<mand1<b<n. We must 
have a; > m or b; > n for some 2. 


1. Diophantine Approximation 


EXAMPLE 7.6. Suppose that a is a real number and that N is a positive integer. Show 
that one of the numbers a, 2a, 3a,..., Na differs at most x from an integer. 
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It is not so clear a priori how one can use the pigeonhole priciple in this problem. Since 
we are only interested in the value ia modulo the integers, we define 6; = ia — [ia]. (|x| is 
the largest integer < x.) We observe that 0 < 8; < 1 for all 7. We want to show that 6; < x 
or B; >1- + for some 7. 

An important observation is that {;,; — 6; is the same as a; up to an integer for all 7. 
So if 6:4; is very close to §; then a; is very close to an integer. We want to show that one 
can choose two of the N + 1 numbers (69 = 0, (1, So,..., Bn that are at most of distance + 
of each other. 

Here is where the pigeonhole principle might come in. We need to partition the interval 
(0,1) into N sets, such that every two elements in the same set are at most + apart. This 
is possible. We can take S;, = [4+,£) for k = 1,2,...,N. By the pigeon hole principle at 
least two of the numbers in {, 61,..., Gy lie in the same set S;. Let us write down a more 
formal proof. 


PROOF. Define 


6; = ia — lia] 
for 1 = 0,1,2,...,N. Then 0 < 8; < 1 for all 7. Define S,; = [S+, 4) VG) Na ae mn a Vee Bt 
The interval {0,1) is the union of intervals S),52,...,Sj. By the pigeon hole principle, at 
least two of the numbers {, ;,..., By lie in the same interval S;, for some k. Say {;, 8; € Sz 
for some 2,7 with 0<i<j< N. But then 
1 ey ; 1 
Hy 285 — B= Ha — (Lal — |ia]) < 5. 


We are done because 1 < j —i < N and |ja| — |éa] is an integer. 


diophantine approximation is an area of number theory where one likes to approximate 
irrational numbers by rational number. The previous example allows us to characterize 
irrational numbers! 


THEOREM 7.7. A real number a is irrational if and only if there exists a sequences of 
integers pi, p2,... and q1,qo,... such that 


lim qn — Pn = 0 
noo 
and qnQ@ — Pn #0 for alin. 


PROOF. Suppose that a is irrational. For every n, we can find integers py, gn with 
1 <q <n such that |g,a — pr| < 4. It follows that 


lim qn — pn = 0. 
n> co 
Obviously gna — Pn # 0 for all n because a is irrational. 


Conversely, suppose that gna@ — pn # 0 for all n and limyn-+o6 dn@ — Pn = 0. Assume that 
a is rational, say a = § with a,b € Z and b > 0. Now we have 


Gn@ — Pnd a 1 
b — Ob 


because g,Q@ — py, # 0. This leads to a contradiction with lim,_..5 d,@ — pn = 0. 


lana =p; = 


We can apply this: 
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THEOREM 7.8. The number V2 is irrational. 
Proor. Expanding (2 — 1)" and using (2)? = 2 we get 
(V2 —1)" = gnV2— Dn 
for some integers p, and g,. We have 
lim dnV2 — pn = lim (V2 — 1)" =0 
n—00 n—00 
because |/2 — 1| < 1. Also 
dnV2 — Pn = (V2 — 1)" #0 
for all n because 2 # 1. This proves that /2 is irrational. 


2. balls 


For a finite set S we denote the number of elements of S by |S|. The pigeonhole principle 
can be reformulated as: 


THEOREM 7.9. If 5 1,S2,...,S, are subsets of a finite set T and 
Si| + |So|+---+|S,| > IT] 
then there exists an element x € T that lies in at least k +1 of the sets $1, S2,..., Sn. 


If S is a measurable set in R®, let ju(.S') be its volume. We have the following variation 
of the previous theorem. 


THEOREM 7.10. If 51, .S2,...,Sn are measurable subsets of a measurable set T, and 


w(S1) + w(S2) +++ W(Sh) > ku(T). 
then there exists an element x € T that lies in at least k + 1 of the sets S1,So,..., Sp. 


EXAMPLE 7.11. Let S be a set of points in the cube [0,1] x [0,1] x [0,1] (in R*). Such 
that the distance between every two distinct elements x,y € S is at least 0.1. Give an upper 
bound for the number of elements of S. 


We could use the pigeonhole principle as follows. Partition the cube into small regions, 
such that in each region the maximal distance between two points in this region is < 0.1. 
For example, we could partition the cube into N x N x N little cubes of sidelength x 
The maximum distance between two points in this little cube is /3/N, the length of its 
diagonal. We need that /3/N < 0.1, so N > 10/3 © 17.32. (Without a calculator we see 
that 10/3 < 18 because 18? = 324 > 300 = (10/3)?.) So let us take N = 18. Each of the 
N x N x N cubes can contain at most 1 element of S. Therefore, the cardinality of S is at 
most 18° = 324-18 = 5832. 

There is another way of looking at this problem. Instead of saying that two points x and y 
have distance at least 0.1, we could say that the balls with radius 0.05 around x and around y 
are disjoint. Note that all balls lie within the cube [—0.05, 1.05] x [—0.05, 1.05] x [—0.05, 1.05] 
with volume 1.1? = 1.331. We can reformulate the problem as the problem of packing oranges 
of diameter 0.1 into a (cube-shaped) box of sidelength 1.1. The volume of a ball with radius 
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0.05 is $7(0.05)? = 0.00523599 (we are using a calculator now). The number of oranges is 
at most 
(Lay _ so 5 2) 7086 
$7(0.05)3 Ar 

This means that S' has at most 2542 points. This is quite an improvement. 

Johannes Kepler (1571-1630) conjectured in 1611 that the densest way of packing balls in 
R? is the cubic or hexagonal packing (well-known to people selling oranges). These packing 
give a density of 


& 2542.02. 


1 

—— & 74.048%. 

3V2 : 

Kepler’s conjecture was proven by Thomas Hales (U of M!) in 1998. 
Using this result, we see that S can have at most 


ot es 198 14/9 1889-8189 
T 


So S' can have at most 1882 elements. (We do not claim here that this number is sharp.) 


EXAMPLE 7.12. A binary word of length n is a sequence of 0’s and 1’s of length n. The 
set {0,1}" is the set of all binary words of length n. Let S be a subset of {0,1}” with the 
following property: for every pair of distinct elements x = %1%2---%, and y = y1Yo°++ Yn we 
have that x and y differ in at least 3 positions. Show that S' has at most 

Qn 
ned 


elements. 


PROOF. We can use the ideas in the previous example. The distance d(x, y) between 
two words © = %1%2°++X%p, and y = y1Yo2°++ Yn is the number of positions where they differ. 
For any binary word x € {0,1}”, let B(x) be the ball with radius 1, so 


Bley =Aee LOL" | ae, 2). <1. 


In other words B(x) is the set of all binary words of length n which differ from x in at most 
1 position. The number of elements of B(x) is n+ 1 (namely « itself and all words obtained 
by changing x at on position). Notice now that d(x, y) > 3 is equivalent with B(x) and B(y) 
are disjoint. So all balls 

Biz), xreEs. 


are disjoint. The disjoint union of all balls 

LU B(x) 

res 
has exactly |S|(n +1) elements. On the other hand, this union is a subset of {0,1}” which 
has 2” elements. We obtain the inequality 


\S|(n +1) <2” 


The previous example is known as the Hamming bound for 1-error correcing binary codes. 
The mathematician Richard Hamming (1915-1998) also studied examples where equality 
holds (and these are known as Hamming codes). 
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3. Exercises 


EXERCISE 7.1. * What is the maximum number of rooks that one place on an 8 x 8 
chessboard such that no two rooks can hit each other? Prove your answer. 


EXERCISE 7.2. * 
What is the maximum number of queens that one can place on an 8 x 8 chessboard such 
that no two Queens can hit each other? 


EXERCISE 7.3. **** What is the maximum number of queens that one can place on an 
8 x 8 chessboard such that no two Queens can hit each other? 


EXERCISE 7.4. ****/Dutch Mathematics Olympiad] A set S of positive integers is called 
square-free if for all distinct a,b € S' we have that the product ab is not a square. What is 
the maximum cardinality of a square free subset S C {1,2,3,...,25}? 


EXERCISE 7.5. * Show that (a — b)(a — c)(b —c) is always even if a, b,c are integers. 


EXERCISE 7.6. **** Prove that for every integer n > 2 there exists an integer m such 
that k? — k +m is not divisible by n for all integers k. 


EXERCISE 7.7. *** Show that, given a 7-digit number, you can cross out some digits at 
the beginning and at the end such that the remaining number is divisible by 7. For example, 
if we take the number 1234589, then we can cross out 1 at the beginning and 89 at the end 
to get the number 2345 = 7 x 335. 


EXERCISE 7.8. **** Improve the statement in Example 7.6: Suppose that a is a real 
number and that N is a positive integer. Show that one of the numbers a, 2a, 3a,--- ,Na 


differs at most wot from an integer. 


EXERCISE 7.9. *** Prove that the Euler number 
1 1 1 1 
| 


is irrational. 


EXERCISE 7.10. **** 
Let 21, %2,...,%n € R with |z,;| < 1 fori =1,2,...,n. Show that there exist a), a2,...,@n € 
{—1,0,1}, not all equal to 0, such that 


| | | | | < i 
A,X + AgX2 4 Fr Antn| S on] 

EXERCISE 7.11. ***** [IMO 1987] Let 21, r2,...,2%n be real numbers satisfying x? +23 + 
--- +”? = 1. Prove that for every integer k > 2 there are integers a1, @2,...,@n, not all 0, 


such that |a;| << k —1 for all 7 and 
(k —1)J/n 
kn —] 
EXERCISE 7.12. ***** [Dutch Mathematical Olympiad] Suppose that S is a subset of 


{1,2,3,...,30} with at least 11 elements. Show that one can choose a nonempty subset T 
of S such that the product of all elements of T is a square. 


|ay%1 + Go@g +--+ + antp| < 
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EXERCISE 7.13. *** Suppose that ao, a1,d2,...,@, € Z are integers. Show that the 


product 
I] (a; — ai) 
0<i<j<n 
is divisible by nl. 
EXERCISE 7.14. **** Let ai, a2,...,@19 be distinct integers from {1,2,...,99}. Show 
that {a1,a2,...,@19} contains two disjoint non-empty subsets with the sum of the numbers 


from the first equal to the sum of the elements from the second subset. 


EXERCISE 7.15. ***** The set M consists of 2001 distinct positive integers, none of 
which is divisible by any prime p > 23. Prove that there are distinct x,y, z,t in M such that 
xyzt = u* for some integer uw. 


EXERCISE 7.16 (Putnam 1989). **** Let m be a positive integer and let G be a regular 
(2m+1)-gon inscribed in the unit circle. Show that there is a postive constant A, independent 
of m, with the following property. For any point p inside G there are two distinct vertices v1 


and v2 of G such that 
1 A 
[pg = pao | ae: 
at 
Here |s — t| denotes the distance between the points s and t. 


EXERCISE 7.17. *** Show that 4 


2p 
P 
where p runs over all prime numbers, is an irrational number. 


EXERCISE 7.18 (Putnam 2000). Let a,,b;,c; be integers 1 < 7 < N. Assume for each J, 
at least one of a;,b;,c; is odd. Show that there exist integers r,s,¢ such that ra; + sb; + tc; 
is odd for at least 4N/7 values of 7, 1<j <N. 
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CHAPTER 8 


Sequences and Series 


DEFINITION 8.1. A sequence of real numbers 71, %2,73,... converges to a real number x 
if for every 6 > 0 there exists a positive integer N such that for all n > N we have 


|2n —2| <0. 
We will write lim,,_,.. %, = x and say that x is the limit of x71, 72, 73,... as n tends to infinity. 


DEFINITION 8.2. We say 


lim 2, = oo 
noo 


if for every positive real number C’, there exists a positive integer N such that x, > C for 
all n > N. Similarly, one can define what is meant by 


lim xv, = —00. 
n—- oo 
EXAMPLE 8.3. 
lim — = 0. 
noo 1 


PROOF. Given 6 > 0, we can choose N > 1/6. Then for all n > N we have 


1 1 
— =O ee CO. 
n | ~ N 
EXAMPLE 8.4. If |x| < 1, then 
hin 0 
N+ Oo 
Proor. Put ¢ = 1-—|z|. Then 
1 1 


™—(l-—e)"< < : 
Para ) ~ (l+e)"~ l+en 


If we now take N > (de)~!, then for all n > N we have 


1 
[ger sO, 
en 


This shows that lim,-_,,, 7” = 0. 
There are various well-known rules for limits, such as: 


LEMMA 8.5. Jf limy0%n = £© and limp +o Yn = y then limp+o(%n + Yn) = e+ y and 
iim Cathe — Ee. 
There are also some more abstract theorems for showing convergence, such as: 
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THEOREM 8.6. If C is a real number, and 


US 4g S43 << 4%4<... 


is a weakly increasing sequence with x; <C' for alli, then x1, %2,%3,... converges. 


THEOREM 8.7. If 21,22, %3,... 18 a sequence such that for every 6 > 0, there exists an 
n such that |r, —2m| <6 for allm >n. Then x1, 22,%3,... converges. Such a sequence 
1, X9,%3,... 18 called a Cauchy sequence. 


1. Series 
We will discuss convergence of series. 


DEFINITION 8.8. If x1, %2,2%3,... is a sequence of real numbers, then we say that the 


series 
CO 
Ez 
i=l 


converges with value y if the sequence yj, yo, y3,... with y, defined by 


n 
i=1 


converges to y. 


EXAMPLE 8.9. 


PROOF. Note that 


Sle 
ih 
+ 

~~] rR 
IV 
me 
+ 

cS 
| 

i 

ry 
| 


n=1 j=l 1=2941 J=1 1=2341 
k-1 
23 dle 
j=l 
Given a positive integer C, then for m > 2° we have 
cy ee 1 
ae —>2C-=C. 


An alternative proof uses the fact that 


Therefore 
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EXAMPLE 8.10. If |x| < 1 then 


CO - 1 
2 ae 


PROOF. We have 


=1+(-2r+2)4 (=z | x”) Paired al (—2” +2") —2™2 21-9", 
(telescope sum) and therefore 
ee 
a 
n=0 l-a& 


If we take the limit m — oo we get 


= ts 1-2 

y a” = lim y a” = lim = : 
m—00 m>oo 1—2Z 1-—7z 

n=0 n=0 


since lim, 2” = 0 as we have seen before. 


2. Exercises 
EXERCISE 8.1. ** Let x be a real number with |z| < 1. Find and prove a formula for 
1+ 27 +3x7 +42? +--- 


EXERCISE 8.2. ** Suppose that a1, a2,a3,... are real numbers such that 
aj +ag+a3+-::: 
converges. 
(a) Does a? + a3 + a3 +--+ necessarily converge? 
(b) Assume also that a1, a2, a3,... are all nonnegative. Does a?+a3+a3+--- necessarily 
converge? 


EXERCISE 8.3. ** 
(a) Show that (using comparison with an integral for example), that for a > 1, 


(oe) 


1 
n=1 ae 
converges. 
(b) Show that 
2 1 
— nlog(n) 
diverges. 
(c) Show that for a > 1, 
s 1 
“ nlog*(n) 


converges. 
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(d) Show that 


(oe) 


1 
Do isa inne) 


n=2 


diverges (watch out for n = 2). 


EXERCISE 8.4. * Evaluate 
1 1 1 


1-2 2-3 3-4 
1 1 


(Hint: EER = | — 547 telescope sum!) 


EXERCISE 8.5. ** Show 
1 
don 


converges where the sum is taken over all positive integers n which don’t have any 9 if they 
are written in the decimal system. 


EXERCISE 8.6 (Putnam). ***** Let a; < a2 < a3 < --: be an increasing sequence of 
positive integers. Let the series 
= 1 
Bi 


m=1 ” 
be convergent. For any number 2, let k(x) be the number of the a,,’s which do not exceed 
x. Show that lim, ,.. k(x)/x = 0. 


EXERCISE 8.7. *** Show that 

1 
yao 
n=1 Pn 


where p,, is the n-th prime number. (Hint: Use the fact that every natural number has a 
unique prime factorization, and show that 


1 m ae i as —m 
= sen, eh aa pe eras) oor prep ear De) 
n=1 
ss 1 1 1 ) 
bap, Lop “Lape 
EXERCISE 8.8. ***** Construct real numbers a1, d2,a3,... such that )>~ , a, converges, 
and 5°”, a3 diverges. 
EXERCISE 8.9. *** Let aj, a2,... be real numbers such that )> a,/n converges. Show 


that limy_+o + pan GSU. 


EXERCISE 8.10. *** Evaluate 


EXERCISE 8.11 (Putnam 1985). Let d be a real number. For each integer m > 0, define 
a sequence {a,,(j)}, 7 =0,1,2,... by the condition 


ay, (0) =d/2™; and sa,4(7-+ 1) = tan)? +2an(),. 7 20: 
Evaluate limy_5o. Gn (7). 
EXERCISE 8.12 (Putnam 1990). Let 
Ty = 2,7, = 3,7T> = 6, 


and for n > 3, 
Ls = (n + 4)T 1 = 4nT,_-9 + (4n = 8)Tn-3- 
The first few terms are, 
2, 3, 6, 14, 40, 152, 784, 5168, 40576. 
Find, with proof, a formula for T;,, of the form T,, = A, + B,, where {A,} and {B,,} are 
well-known sequences. 


EXERCISE 8.13 (Putnam 1993). Let {an }n>o be a sequence of nonzero real numbers such 
that 2? —@p_12%n41 = 1 for n = 1,2,3,.... Prove that there exists a real number a such that 
tei Hat, = Xi for all nl, 


EXERCISE 8.14. Suppose that A;, Az, A3,... is a sequence of positive integers such that 
A, = land A; < Aj,, < 2A; for all positive integers 7. Prove that every positive integer n 
can be written as a sum of distinct A;’s. 


EXERCISE 8.15. ****[Putnam 1985, B3] Let 
G11 412 41,3 


421 422 42,3 
43,1 43,2 43,3 


be a doubly infinite array of positive integers, and suppose each positive integer appears 
exactly eight times in the array. Prove that a,,, > mn for some pair of positive integers 
(m,n). 


EXERCISE 8.16. Suppose that we have n lines in the plane such that (i) no two lines 
are parallel and, (ii) no three lines go through 1 point. In how many regions do these lines 
divide the plane? Prove your formula. 
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CHAPTER 9 


Generating Functions 


EXAMPLE 9.1. A triangulation of a convex n-gon is a partition of the area of the n-gon 


into triangles such that the vertices of each triangle is a vertex of the n-gon. How many 
distinct triangulations does a convex 10-gon have? 


Let A, be the number of triangulations of an n-gon. Let us find the value of A,, for small 
n. 
We have A3 = 1 and Ay = 2: 


7 \ : S / 


It becomes more and more clear that it m 


ay not be feasible to write down all trian- 
gulations of an 10-gon. As n gets larger, we need a more systematic way of counting the 


possibilities to make sure that we are not forgetting any case. 


Let P and Q be two fixed adjacent vertices of the n-gon. For each triangulation, there is 
a unique vertex R of the n-gon (R 4 P,Q) such that PQR is a triangle in the triangulation. 


For example, for n = 8 there are the following cases: 
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OOW 
es PQ PQ 
Cie 


For fixed R, the complement of the triangle PQR within the n-gon is a union of an m-gon 
and a (n+ 1-—m)-gon. The m-gon has A,, triangulations, and the (n + 1 — m)-gon has 
Anti_m triangulations. This gives AmAnii—m triangulations for this particular choice of R. 
From this we see the equation: 


An = AgAn_1 + AgAn_2 + +++ + An_1 Ao. 
where we define Ay = 1. In particular, 
A, = 1-1441-54+2-245-1+4+14-1=42 
Ag = 1-42+1-144+2-54+5-2+14-1+42-1=132 
Ag = 1-13824+1-4242-144+5-5414-2+4+42-14 1382-1 = 429 
Aig = 1-4294+1-1824+2-424+5-144+14-54 42-24 132-1+ 429-1 = 1430. 
By the way, if one defines C,, = An+2 for all n > 2, then C,, are the so-called Catalan 


numbers. The Catalan numbers have many interesting interpretations (which we will not 
discuss now). It is known that 
2n 
—\n 


ee 


n 


So for example Ajo = Cg = (S) /9'= 1430. 
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CHAPTER 10 


Probability 


1. Discrete Probability 


Discrete probability deals with discrete random variables. For an event A we denote the 
probability that this event occurs by P(A). If A and B are two events, then the probability 
that both A and B occur will be denoted by P(A, B) or by P(A and B). The probability 
that A or B occurs will be denoted by P(A or B), etc. 

The probability that event A occurs, given that B occurs is denoted by P(A|B). This is 
called a conditional probability. The formula for P(A|B) is 

P(A, B) 
P(A|B) = P(B) 
Suppose that we throw two dice. Let A be the event that the first die is 5. Let B be the 
event that the sum of the two dice is > 10. Now we have P(A) = 4. We have P(B) = # 
because there are 5 possibilities to get a sum > 10, namely (4,6), (5,6), (6,6), (5, 5),(6, 4). 
Similarly the probability P(A, B) is equal to 2 because there are two possibilities for the 
first die to be 5 and the sum to be > 10, namely (5,5) and (5,6). 

The probability that the first die is 5, given the fact that the sum of the dice is > 10 is 

equal to 


alr 


P(A,B) 2 2 

P(A|B) = ee 

ae) P(B) a D 

Suppose that X is a random event that has finitely many outcomes, namely {21, %2,..., Un}. 
Each event X = x; has a certain probability, denoted by pj = P(X = 2;). We have0 < p; <1 


for all 2 and 


pi pe b+ pa L 
If X is real-valued, then the expected value of X is 


EBX = pit + pote + +++ + PnXn. 


EXAMPLE 10.1 (The Monty Hall Problem). This problem was inspired from the gameshow 
“Let’s Make A deal”, hosted by Monty Hall. It is now mathematical folklore. 

A TV host shows you three numbered doors (all three equally likely), one hiding a car 
and the other two hiding goats. You get to pick a door, winning whatever is behind it. 
Regardless of the door you choose, the host, who knows where the car is, then opens one of 
the other two doors to reveal a goat, and invites you to switch your choice if you so wish. 
Does switching increases your chances of winning the car? 


The answer is YES! If the host always opens one of the two other doors, you should 
switch. Notice that 1/3 of the time you choose the right door (i.e. the one with the car) 
and switching is wrong, while 2/3 of the time you choose the wrong door and switching gets 
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you the car. Thus the expected return of switching is 2/3 which improves over your original 
expected gain of 1/3. 


Some experiments have an infinite number of outcomes as in the following example. 


EXAMPLE 10.2. Suppose that we throw a coin. If the outcome is heads, then we throw 
again. We repeat this until the outcome is tails. Let X be the number of throws needed. 
Let p be the probability that the a coin gives tails. What is the expected number of throws 
needed? 


The range of X is {1,2,3,...}. The probability P(X = i) is equal to (1 — p)*1p. If 
|z| < 1 then 
1 —]4 Lm? ti... 
1 - = rere T 2 
If we apply this here we see that indeed 


(oe) 


i-1 1 
2 — p) pa = =p) =: 
The expected value of X is 
S-i(1 — p) tp. 
i=1 
If |~| < 1 then we have the formula 
re ee 
(1 — x)? 
From this follows that the expected number of throws EX is equal to 
1 1 
"a=(=p)P P 


2. Inclusion-Exclusion 


For the following example we need to introduce the inclusion-exclusion principle (al- 
though it is not really part of combinatorics rather than probability theory). If A,, Ag are 
finite sets, then 


|A,; U Ag] = |Ai| + |A2| — |Ai U Ao] 
(If X is a finite set then |X| denotes its cardinality). If A,, Ay, A3 are finite sets, then 
|A; U Ag U As| = |Ai] + | Ae] + [As] — |Ai M Ag] — |AiM As] — |A2M As] + [Ai Ae As]. 
These formulas generalize to an arbitrary number of sets. In general one has 
(34) |Ai U Ag U+++U An| = 1 — C2 + 63 — Ca Fe + (-1)"" en. 
where 


Cp = S- AOA nea: 


1<04 <t2<0 <p <n 
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EXAMPLE 10.3. Suppose each person of a group of n people write their name on a piece 
of paper. The papers with the names are put in a random. Each person pulls (without 
looking) a name out of the hat at random. (This is common practice when people want to 
prepare presents for each other on holidays.) Let p, be the probability that someone drew 
his own name. What is lim,_,.5 Dn? 


Let A; be the set of all permutations o of {1,2,3,...,n} for which o(7) =i. (A permu- 
tation o of {1,2,...,n} is a bijective (onto and 1-1) map from {1,2,...,n} to itself. There 
are n! possible permutations.) Now A, U Ag U---U A, is the set of all permutations o for 
which o(i) =i for some i. The probability that someone draws his/her own name is: 


Aru Apu 3 GAR 
a n! ; 


n 


To find |A; U A, U---UA,,| we can use the inclusion-exclusion principle. If 1 <7, < ig < 
s+ <a, <n then 


Aj, NA N-++ 7 Aj, 
is exactly the set of all permutations o which fix 71, 722,...,2,. Such a o can freely permute 
ee reeee 1 ee Cheereee et - 
It follows that 
|Ai, NM Ags N---NA;,| = (n—r)! 


Since there are (") possible choices of 71, 722,...,2, we get that 


C, = (") (n—r)! = 7 


From the inclusion-exclusion principle follows that 


ml ath. sl a ae 
Ae UA ap tgp eo 
and ; 
il 1 oul 


We now have that lim,,., Pn = 1 — 1/e where e is the Euler number, because 


3. continuous probability 


A continuous random variable X has some values inside some infinite set S. On S we 
have some measure (integral). For example X may be a random point on in the square 
[0,1] x [0,1]. If U C [0,1] x [0,1], then the probability P(X € U) is equal the area of U. 
This is an example of a uniform distribution. 


EXAMPLE 10.4. On a piece of paper there are parallel lines drawn every 1 inch. We 
throw a needle on the piece of paper. What is the probability that the needle does not touch 
one of the lines? 
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PROOF. Mark one end of the needle with the letter P and the other end with the letter 
Q. Suppose that that lines are ...,y = —l,y = 0,y = 1,.... Let Y be the random 
variable with values in [0, 1) defined as the y-coordinate P modulo 1. Let X be the random 
variable of the angle of the vector PQ with the positive x-axis. This variable X has values 
in [0,27). The outcome of the pair (X,Y) is uniformly distributed over [0,27) x [0,1). If 
U C (0, 27) x [0, 1) then the probability that (X,Y) € U is exactly the area of U divided by 
2n. Let U be the set of all points (x, y) € [0,27) x [0,1) for which the line-segment between 
(0, y) and (cos(x),y + sin(a)) does not touch the line y = 0 or y = 1. This means that 
0 <y+sin(x) < 1. The area of the set 


U={(z,y)|0<2<27,0<y<1,0<y+sin(z) <1} 


If (x,y) € U then for a fixed angle x, y must lie in an interval of length 1 — |sin(z)|. This 
means that the area of U is 


20 T 
/ 1 —|sin(x)|dx = 2 | 1 — sin(x)dx = 24 + 2cos(x)|j = 2a — 4. 
z=0 z=0 
The probability that the needle does not touch one of the lines is 
a 
m—A ig 2 
21 7 


4. Exercises 


EXERCISE 10.1. * Assume that every born baby has exactly 50% to be a boy and 50% 
change to be a girl. Also assume that the life expectancy for boys and girls is the same. In 
some country, a new law is introduced which says that every couple may have at most one 
boy. This means that every couple who gave birth to a boy is not allowed to have any more 
children. What will happen to the population? Will there be more girls eventually? Or will 
there be more boys eventually? Or will ratio of boys and girls be close to 1? 


EXERCISE 10.2. * There are three pancakes in a hat. One pancake has two yellow sides. 
One pancake has two brown sides. The third pancake has a brown and a yellow side. A 
random pancake is pulled from the hat and it is put on a plate such that only one side is 
visible to you. The side that you can see is brown. What is the probability that the other 
side is brown? 


EXERCISE 10.3 (UMUMC 1997). ** 2000 students participated in a math competition. 
They had been arbitrarily assigned code numbers from 1 to 2000. All 2000 scores were 
different. Given that student 1 scored higher than students 2 through 1997, what is the 
probability that student 1 had the highest score overall? 


EXERCISE 10.4 (UMUMC 1997). *** 


(a) A and B plan to play a game where they take turns tossing a coin until someone 
flips head and thereby wins. On the basis of alphabetical order, A claims the right 
to go first, but B objects that this gives A an unfair advantage. To compensate, A, 
offers to allow B to use a biased coin, whereas A will use a fair coin. Prove that the 
game is still biased in A’s favor. 
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(b) In view of your proof of part (a), B demands further adjustments in the probabilities 
of the individual coin flips. Because it is difficult to adjust the bias of a coin, the 
players agree to play the following variant of the game. A large (but finite) number 
of ping-pong balls are placed in a bag; some ar labeled A and some B. The players 
alternately draw one ball out of the bag (each ball being equally likely and the draws 
being independent). The first player to draw a ball labeled by his own name is the 
winner. Any balls drawn that doesn’t win this way is put back into the bag before 
the next draw. A will draw first, but to compensate for this advantage, more balls 
will be labeled B than A. Is it possible to choose the number of balls with each 
label in such a way that the game is fair. If so, what should the numbers of balls 
be? 


EXERCISE 10.5 (UMUMC 2003). **** An unbalanced penny and an unbalanced quarter, 
with probabilities of head p for the penny and q for the quarter, are tossed together over 
and over. The probability that the penny shows heads (strictly) before the quarter is 3/5, 
and the number of tosses required for both coins to show heads simultaneously has expected 
value exactly 4. Find the values of p and q. 


EXERCISE 10.6 (Putnam 1989). **** If q@ is an irrational number, 0 < a < 1, is there a 
finite game with an honest coin such that the probability of one player winning the game is 
a? (An honest coin is one for which te probability of heads and the probabily of tails are 
both 5. A game is finite if with probability 1 it must end in a finite number of moves.) 


EXERCISE 10.7 (Putnam 2002). *** Shanillo O’ Keal shoots free throws on a basketball 
court. She hits the first and misses the second, and thereafter the probability that she hits 
the next shot is equal to the proportion of shots she has hit so far. What is the probability 
she hits exactly 50 of her first 100 shots? 


EXERCISE 10.8. **** Prove the inclusion-exclusion principle. 


EXERCISE 10.9. ** In a classroom there are 22 students. Show that the probability that 
two of the students have the same birthday is more than 50% (for simplicity you may assume 
that no students were born on February 29, you may need a computer or a calculator). 


EXERCISE 10.10. ***** [Putnam 1992] Four points are chosen at random on the surface 
of a sphere. What is the probability that the center of the spere lies inside the tetrahedron 
whose vertices are at the four points? (It is understood that each point is independently 
chosen relative to a uniform distribution on the sphere.) 


EXERCISE 10.11. *** On a piece of paper we draw a grid. We have horizontal lines, and 
vertical lines. The horizontal lines are 1 inch apart and so are the vertical lines. A needle of 
one inch long is thrown at random on the piece of paper. What is the probability that the 
needle does not tuch any of the lines. 


EXERCISE 10.12. **** [Putnam 1993] Two real numbers x and y are chosen at random 
in the interval (0,1) with respect to the uniform distribution. What is the probability that 
the closest integer to x/y is even? Express the answer in the form r+ sz, where r and s are 
rational numbers. 


EXERCISE 10.13 (UMUMC 1996). ***** One person in a ring of n people has a keg of 
beer. He takes a sip and passes the keg to the left or to the right with 50% probability. The 
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recipient of the keg takes a sip and also passes the keg to the left or to the right with 50% 
probability. The process is repeated until everyone has had at least one sip. What is the 
probability distribution of the final position of the keg? 


EXERCISE 10.14 (UMUMC 2002). ***** Consider the sequence of first digits in the 
successive powers of 2: 
DA Oy le oeOs dein, 
Does one fo the digits 7 and 8 appear more often in the sequence than the other one? (We 
say for example that 5 appears more often than 6 in the sequence if there exists a positive 
integer N such that for all n > N, 5 appears more often than 6 among the first n terms of 
the sequence. ) 


EXERCISE 10.15. **** Let C be the unit circle x? +y? = 1. A point p is chosen randomly 
on the circumference C’ and another point q is chosen randomly from the interior of C’ (these 
points are chosen independently and uniformly over their domains). Let R be the rectangle 
with sides parallel to the x and y-axes with diagonal pg. What is the probability that no 
point of R lies outside of C? 
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CHAPTER 11 


Games 


1. Weighing Problems 


There are many popular problems involving weighing with either a balance scale or a 
numerical scale. A balance scale has two sides. With one weighing there are three possible 
outcomes, the left side is heavier than the right side, the left side is lighter than the right 
side or the left and the right side have the same weight. A weighing scale which gives you 
the weight of an object in pounds (or another unit) we will call a numerical scale (to make 
a clear distinction). 


EXAMPLE 11.1. Suppose that we have 25 coins that look identical. The coins are all the 
same except that one coin is counterfeit and heavier than the others. How can one determine, 
in three weighings on a balance scale, which coin is counterfeit. 


Divide the 25 coins up in three groups, say A, B and C such that A and B have 8 coins 
each and C has 7 coins. Put the A coins on the left side of the scale and the B coins on the 
right side of the scale. If A is heavier then A contains the counterfeit coin. If B is heavier 
then B contains the counterfeit coin. If both sides balance, then C has the counterfeit coin. 

So we found a set of 7 or 8 coins of which we know that one is counterfeit. We may take 
(for convenience) 1 or 2 coins from the remaining genuine coins so that we have 9 coins. We 
have to find out, with two remaining weighings which is the counterfeit coin among these 9 
coins. Split the 9 coins up in three groups, D,E and F, such that each group has exactly 3 
coins. Put the D coins on the left and E coins on the right of the scale. Again we find out 
which of the groups D,E,F cointains the counterfeit coin. 

We now have three coins left. Put one coin on the left and one coin on the right of the 
scale. The coin that is the heaviest is the counterfeit. If both coins weigh the same, then 
the third coin is the counterfeit one. 


EXAMPLE 11.2. Suppose that we have 30 coins that look identical. The coins are all the 
same except that one coin is counterfeit and heavier than the others. Is it always possible 
to determine, in three weighings on a balance scale, which coin is counterfeit. 


The answer is no. Each weighing has three possible outcomes. One could graph the 
possible events as a tree: 


From the 3 weighing events there can only be at most 3 x 3 x 3 = 27 possible outcomes. 
By the pigeonhole principle there are two numbers 7 and 7 with 1 <i < y < 30 such that 
the results of the weighings will be exactly the same if either the 7-th coin or j-th coin is the 
cointerfeit coin. 


EXAMPLE 11.3. Suppose you have 10 barrel of coins. Each barrel contains all real coins 
or it contains all fake coins. The real coins weigh 10 grams, and the counterfeit coins weigh 11 
grams. There is exactly one barrel with counterfeit coins. Determine, with only one weighing 
on a numerical scale, which barrel cointains the cointerfeit coins. (One may assume that the 
barrels cointain “enough” coins.) 


PROOF. Take one coin from the first barrel, two coins from the second barrel, three coins 
from the third barrel, etc. Put these 1+ 2+ ---+ 10 = 55 coins on the scale. If barrel k 
contains the counterfeit coins, then the weight will be 


1-10+2-104+---+10-10+ (11—10)-k =550+k. 
grams. This means that we can tell immediately from the weight which barrel contains the 


counterfeit coins. If the weight is for example 557, then this means that barrel 7 cointains 
the counterfeit coins. 


2. Tilings 


Vaguely speaking, an invariant is a quantity that remains the same after certain opera- 
tions. For many problems it is useful to identify such invariants. 


EXAMPLE 11.4. ** We cut out two opposite corner fields of a chessboard. Is it possible 
to put 31 domino tiles (of size 2 x 1) on the remaining 62 fields of the chessboard? 


* 
ed 


- Sam. 


PROOF. The answer is no. The two corner which were cut out have the same color. 
Without loss of generality we may assume that they both were black. Let J be the number 
white fields that have been covered by domino tiles minus the number of black fields that 
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have been covered by domino tiles. If there are no domino tiles on the chessboard, then 
I = 32—30 = 2. Every time we put another domino tile on the chessboard, then this domino 
tile will cover exactly one black field and one white field. This means that the quantity I 
doesn’t change. If the whole chessboard without the two corners would be covered with 
domino tiles, then J = 0 which is impossible because J is constant and equal to 2. 


EXAMPLE 11.5. Suppose that p, g and n are positive integers such that n is not divisible 
by p or by q. Prove that an n x n floor cannot be tiled by p x p or by q x q tiles. 


PRooF. The idea of the proof is the following: Number the rows and columns with 
0,1,2,...,2—1. Let us write numbers a;; on square i,7 such that: (1) The sum of all the 
numbers on a p X p or q X q square is always 0 and (2) the sum of all the numbers a;,;, 
1 <i,j <n is nonzero. This then would clearly prove that the n x n floor cannot be tiled 
with p x p and q x q tiles. 

To ensure that the sum of all the numbers under a p x p square is 0, we could force that 


Gig Oa f hee: Pag psi 0 


for all 7,7. To ensure that the sum of all the numbers under a q x g square is 0, we could 
force that 


Gig + Aig+1 T° + Gig+g-1 = 0 


for all i, 7. The easiest way to choose the a;,; in this fashion is to use complex numbers (but 
one could avoid this). 
For any integer k, let ¢, = e 


14644-4074 = (1-¢)/0-¢) 


is equal to 0 if and only if / is divisible by k. Fill the n x n = n? fields with complex numbers 
as follows. Number the rows and columns by 0,1,2,...,—1. Put the complex number ¢}¢/ 
on the field in row 7 and column j. The sum of all numbers over all fields is 


Ye GG SOP Gere la Gang) 


O<i,j<n 


2ri/k be the k-th primitive root of unity. Observe that 


is nonzero since p and q do not divide n. On the other hand, if we place a p x p tile (with 
one corner at (k,/)), then the sum of all complex numbers under the tile is 


k+p—11l+p—1 p-1 p-1 
pe ee Gul rene Bene! 
i=k j=l i=0 i=0 
since 
p-1 
pet 
i=0 


Similarly all the numbers under a q x q tile sum up to 0. This shows that it is not possible 
to tile the n x n floor with p x p and q x q tiles. 
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3. Exercises 


EXERCISE 11.1. * Suppose that we have 3” coins that look identical. The coins are 
all the same except that one coin is counterfeit and heavier than the others. How can one 
determine, in n weighings on a balance scale, which of the coins is counterfeit? 


EXERCISE 11.2. **** Suppose that we have 12 coins that look identical. The coins are 
all the same except that one coin is counterfeit and does not have the same weight as the 
real coins. We do not now if the counterfeit coin is heavier or lighter than the real coins. 
How can one determine, in 3 weighings on a balance scale, which of the coins is counterfeit 
and whether the counterfeit coin is heavier or lighter? 


EXERCISE 11.3. * Suppose that we have 14 coins that look identical. The coins are all 
the same except that one coin is counterfeit and does not have the same weight as the real 
coins. We do not know if the counterfeit coin is heavier or lighter than the real coins. Show 
that it is not always possible to determine, in 3 weighings on a balance scale, which coins is 
counterfeit and whether it is heavier or lighter at the same time. 


EXERCISE 11.4. **** Suppose that we have a balance scale and exactly 5 weights, weigh- 
ing exactly 71, %2, 43, v4 and x5 grams. For any positive integer n < 100 one would like to 
be able to determine using the scale whether a given object weighs less than, more than or 
exactly n grams. How should one choose the weights 71, 72, #3, %4, 75 such that this is always 
possible? 


EXERCISE 11.5. *** Suppose you have 10 barrel of coins. Each barrel contains all real 
coins or it contains all fake coins. The real coins weigh 10 grams, and the counterfeit coins 
weigh 11 grams. This time, there may be several barrels with cointerfeit coins (or even all 
or none of them). Determine, with only one weighing on a numerical scale, exactly which of 
barrels cointain the cointerfeit coins. (One may assume that the barrels cointain “enough” 
coins.) 


EXERCISE 11.6. *** Show that 12 x 11 rectangular floor cannot be covered using only 
tiles of the following shapes: 


fa fei sad 


EXERCISE 11.7. ***** Suppose that p, g and r are distinct prime numbers and N > 2pqr. 
Show that an N x N floor can be tiled with p x p, q x q and r x r tiles. (Hint: Write 
N = apq+ bpr + cqr for certain nonnegative integers a,b,c. Use this to divide the N x N 
floor in regions which are easy to tile.) 


EXERCISE 11.8. ***** A rectangle R is divided into smaller rectangles. Each of the 
smaller rectangles has at least one side, whose length is an integer. Show that R itself has 
at least one side which is an integer. 


EXERCISE 11.9. ** You are at the coordinates (1,0,0) in R® where we use the usual ryz 
coordinate axis. A three dimensional knight jump is if you move +1 along one coordinate 
axis, +2 along a second coordinate axis and +3 along the third coordinate axis. For example 
one could jump from (1,0,0) to (1,0,0) + (2, -1,3) = (8,—1,3). Then one could jump to 
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(3, —1,3) + (—3, 2, -1) = (0,1,2) and from there to (0,1, 2) + (1,2, -3) = (1,3, -1). Show 
that it is impossible to land at (0,0,0) after finitely many three dimensional knight jumps. 


4. Games 


EXERCISE 11.10. ** There are 25 matches on the table. Two players take turns. Each 
turn they have to take away 1,2 or 3 matches. The person taking the last match loses. Show 
that the second player always can win this game. (Try it first with 5,9 and 13 matches 


TTT 


EXERCISE 11.11 (IMO). ***** To each vertex of a regular pentagon an integer is assigned 
in such a way that the sum of all five numbers is positive. If three consecutive vertices are 
assigned the numbers x, y, z respectively and y < 0 then the following operation is allowed: 
the numbers x,y,z are replaced by x + y,—y,z + y respectively. Such an operation is 
performed repeatedly as long as at least one of the five numbers is negative. Determine 
whether this procedure necessarily comes to an end after a finite number of steps. 


EXERCISE 11.12 (after a well-known puzzle). **** In a 4 x 4 square, we put the numbers 


2,1,3,4,5,6,...,15 (see below). The last square is black. In each move, we may exchange 
the black square with one of its neighbors (neighbor means sharing an edge). Is it possible 
to get 1,2,3,...,15 after finitely many moves (see second picture). 
2/1/3/4 1/2 )3 |4 
G/F ks MOe Wr IS 
9 | 10) 11) 12 9 | 10) 11) 12 
13) 14) 15 fel 13/14) 15 lol 


EXERCISE 11.13. Suppose that we have an m x n chocolate bar. We break the chocolate 
bar into two pieces. Then we take one piece and break it into two. We keep repeating this 
until we are only left with mn pieces of size 1 x 1. How many times do we have to break the 
chocolate bar? Prove your formula. (In particular, show that the number of breaks needed 
does not depend on how you proceed.) 


EXERCISE 11.14. **** Define a sequence 21,22, 23,... by 41 = 1, x2 = 5 and 
2 
TE ane ai! 
=> = 
a Wn 


for n > 2. What is lim,_..5 2p? 
EXERCISE 11.15. *** We start with the numbers 


1,2,3,4,5,6,7,8,9, 10 
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Then we replace two numbers, say x and y, by ry/(x + y). We repeat this until there is 
only one number left. Show that, regardless how you do it, this number is always equal to 
2520/7381. (For example, we could replace 3 and 6 by 3-6/(3 + 6) = 2 to get the sequence 


1,2,4,5,7,8, 9, 10, 2. 

Then we can replace 9 and 10 by 9- 10/(9 + 10) = 90/19 to get the sequence 
1,2,4,5, 7,8, 2, 90/19. 

etc.) 
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